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SECTION  1 
INTRODUCTION 


A  requirement  of  digital  signal  processing  is  that  analog  (voltage)  signals  are  sam¬ 
pled  and  then  converted  to  sequences  of  numbers.  In  the  conventional  quantization 
methods,  a  given  signal-sample  is  converted  (by  scaling  and  rounding)  to  a  single  in¬ 
teger  whose  value  is  proportional  to  the  signal’s  voltage;  the  subsequent  processing 
manipulates  these  integers,  and  produces  a  fined  result  (another  integer)  that  again  is 
'roportional  to  the  corresponding  analog  output. 

Modern  algebra  and  number  theory  offer  many  alternatives  to  this  simple  way 
of  digitally  representing  numbers,  and  some  alternatives  lead  to  processors  that  are 
simpler,  faster,  and  easier  to  design  and  test.  In  the  residue  number  system  (RNS) 
approach,  for  example,  each  integer  is  replaced  by  a  set  of  smaller  integers,  and  then 
processing  (consisting  of  additions  and  multiplications)  is  performed  by  manipulating 
these  smaller  integers  independently  in  parallel. 

Processing  with  algebraic  integers ,  in  which  the  signal  sample  is  represented  by  a 
set  of  (typically  four  to  eight)  small  integers,  was  introduced  in  [3]  .  This  approach, 
when  combined  with  RNS  processing,  leads  naturally  to  processors  composed  of  simple 
parallel  processing  channels,  and  is  especially  well  suited  to  situations  involving  infor¬ 
mation  with  both  amplitude  and  phase.  Roughly  speaking,  algebraic  integers  can  be 
used  to  add  a  second  level  of  parallelism  to.  integer  RNS  processing.  RNS  processing 
with  algebraic  integers  is  depicted  schematically  in  figure  1. 

Two  types  of  problems  must  be  resolved  before  the  method  can  be  used  in  practice. 
First,  the  analog  samples  must  be  converted  into  the  algebraic-integer  representation, 
and  the  final  algebraic-integer  result  converted  back  to  an  analog  output.  These  are 
quantization  problems.  Second,  the  algebraic-integer  representation  must  be  converted 
into  and  out  of  the  two  levels  of  RNS  parallelism,  corresponding  to  the  outer  and  inner 
channels  in  figure  1.  These  are  RNS  conversion  problems. 

This  paper  addresses  these  quantization  and  conversion  problems  and  proposes 
solutions  that  can  be  implemented  with  current  technology.  The  solutions  can  be  pro¬ 
grammed  for  various  choices  of  algebraic  integers  and  RNSs.  Once  the  quantization 
and  conversion  structures/designs  that  are  contained  in  this  paper  are  implemented, 
subsequent  algebraic- integer  RNS  processing  can  be  completed  by  merely  developing 
the  RNS  processing  channels.  These  channels  are  the  same  as  in  integer  RNSs,  and 
the  possibility  exists,  with  some  restrictions,  of  utilizing  processing  channels  that  were 
developed  previously. 
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Figure  1.  RNS  Processing  with  Algebraic  Integers 
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One  objective  of  this  work  was  to  obtain  results  that  were  independent  of  a  particu¬ 
lar  processing  function  or  application.  Thus,  for  example,  the  performance  of  quantizers 
derived  from  the  algebraic  integers  are  characterized,  as  is  common,  in  terms  of  signal- 
to-noise  ratios  assuming  particular  probability  distributions  on  the  inputs.  On  the  other 
hand,  the  impact  on  overall  performance  of  the  requantization  from  algebraic  integers 
back  to  analog  or  digital  form  is  expected  to  be  application  and  processing  function  de¬ 
pendent.  This  paper  considers  the  effects  of  requantization  in  the  case  of  a  processing 
function  consisting  of  a  sum  of  products,  and  illustrates  the  issues  involved  for  a  55-tap 
finite  impulse  response  filter. 

The  next  section  introduces  the  algebraic-integer  and  RNS  concepts  and  notations 
that  are  needed  in  this  paper.  Section  3  treats  the  problem  of  analog-to-algebraic- 
integer  conversion.  Sections  4  and  5  deal  with  the  two  kinds  of  RNS  conversions,  which 
involve,  respectively,  integer  and  polynomial  RNSs.  Section  6  treats  the  problem  of 
converting  back  to  digital  or  analog  form.  Finally,  section  7  contains  a  description  of 
future  work  and  the  conclusion. 
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SECTION  2 


THE  ALGEBRAIC-INTEGER  NUMBER  REPRESENTATION 
AND  RESIDUE  NUMBER  SYSTEM  PROCESSING 


The  idea  of  using  algebraic  integers  to  represent  numbers  in  RNS  signal  process¬ 
ing  can  be  viewed  as  a  generalization  of  the  quadratic  residue  number  system  (QRNS) 
concept  [5]  ,  [9]  .  In  this  section,  the  algebraic-integer  number  representation  is  intro¬ 
duced.  RNS  processing  is  reviewed  and  combined  with  the  algebraic- integer  number 
representation.  A  short  history  of  algebraic-integer  processing  is  included. 


2.1  ALGEBRAIC-INTEGER  EXTENSIONS 

Algebraic  integers  are  roots  of  polynomials  that  have  integer  coefficients  with  the 
leading  coefficient  equal  to  one.  For  example,  \/2  is  an  algebraic  integer  since  it  is  a 
root  of  the  polynomial  x 2  —  2.  For  the  present  purpose,  a  general  algebraic  integer 
can  be  viewed  as  an  integer  linear  combination  of  certain  fixed  irrational  or  imaginary 
numbers.  The  introduction  that  follows  uses  a  sequence  of  examples;  a  formal  treatment 
can  be  found  in  [8]  .  The  examples  involve  both  real  and  complex  algebraic  integers. 
The  real  and  complex  examples  are  related,  and  this  relationship  will  be  used  to  reduce 
the  complexity  of  implementations  involving  in-phase  (I)  and  quadrature  (Q)  processing 
channels.  In  the  following,  Z,  Q,  R,  and  C  denote,  respectively,  the  integers,  the  rational 
numbers,  the  real  numbers,  and  the  complex  numbers.  The  imaginary  number  \f-[  is 
denoted  by  i. 

.The  example  of  the  Gaussian  integers  {a  +  bi  :  a,  b  6  Z},  which  are  denoted  by 
Z[z]  and  form  the  basis  of  QRNS,  should  be  kept  in  mind.  They  can  be  viewed  as  being 
constructed  by  adjoining  i  to  the  integers.  The  fact  that  i  satisfies  i2  =  —1,  equivalently 
that  i  is  a  root  of  x2  +  1,  is  used  to  reduce  higher  powers  of  i  when  multiplying  elements 
of  Z(i].  This  theme  is  repeated  in  each  of  the  examples  of  this  section. 


2.1.1  Adjoining  \fi 

The  first  example  involves  real  algebraic  integers.  If  y/2  is  adjoined  to  the  field 
Q,  the  field  Q(\/2)  =  {a  -f  by/ 2  :  a,  6  €  Q}  results.  The  subset  of  Q(v^)  of  interest  is 

Z[y/2]  =  {a-\-b\/ 2  :  a,b  €  Z}.  The  set  Z[y/2]  is  closed  under  addition  and  multiplication: 

(a  +  by/2)  +  (c  +  dV2)  =  (a  +  6)  +  (c  +  d)y/ 2,  (2.1) 

(a  +  by/2){c  +  dy/ 2)  =  (ac  +  2bd)  +  (ad  +  bc)y/2.  (2.2) 
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With  these  operations,  Z[>/2]  forms  a  ring.  The  elements  of  Z[\/2  ]  are  algebraic 
integers  (they  are  roots  of  quadratic  or  linear  polynomials  with  integer  coefficients  and 
leading  coefficient  equal  to  one).  For  this  example,  Z[\/2]  contains  all  the  algebraic 
integers  of  Q(\/2)  and  is  referred  to  as  the  ring  of  algebraic  integers  for  Q(  \/2).  The 
analogy  to  keep  in  mind  is  that  Z[\/2]  is  to  Q(V2)  as  the  ring  Z  is  to  the  field  Q. 

Q(\/2)  is  a  second-degree  extension  of  Q  since  the  adjoined  element,  >/2,  is  a  root 
of  an  irreducible  polynomial  of  degree  two  over  Q,  called  its  minimum  polynomial.  This 
relation  for  \/2  is  actually  used  in  (2.2)  to  obtain  an  element  of  the  form  a  +  b\/ 2  (in 
the  same  way  the  relation  i2  =  —  1  for  i  is  used  in  complex  arithmetic). 

The  elements  1  and  \/2  form  a  basis  for  Q(V2)  as  a  vector  space  over  Q.  In 
other  words,  the  elements  of  Q(  v^2)  are  uniquely  represented  by  the  form  a  +  b\/2. 
Consequently,  the  elements  of  1\\/2\  are  also  represented  uniquely  as  integer  linear 
combinations  of  1  and  y/2. 

The  ring  Z[\/2]  is  in  fact  a  dense  subset  of  R,  meaning  that  any  real  number  x 
can  be  approximated  to  any  desired  degree  of  accuracy  by  elements  of  Z[\/2].  How¬ 
ever,  to  obtain  increasing  accuracy,  the  coefficients  a  and  b  grow  in  size.  For  example, 
table  1  lists  elements  of  Z[\/2]  with  decreasing  magnitude  and  increasing  coefficient 
sizes.  Integer  multiples  of  these  points  can  be  used  to  obtain  finer  and  finer  grids  for 
approximating  elements  of  R.  These  approximations  come  from  the  convergents  of  the 
continued  fraction  expansion  of  y/2,  and  the  list  can  be  continued  to  obtain  any  desi.ed 
degree  of  accuracy. 


Table  1.  Elements  of  Z[V2)  with 
Decreasing  Magnitude 


a 

b 

a  +  b>/2 

-1 

1 

.4142 

3 

-2 

.1716 

-7 

5 

.0711 

17 

-12 

.0294 

-41 

29 

.0122 

99 

-70 

.0051 

-239 

169 

.0021 

Algebraic  integers  can  be  used  to  represent  the  numerical  quantities  in  finite 
wordlength  implementations  of  signal  processing  functions.  For  example,  elements  of 
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R  could  be  represented  by  ordered  pairs  (a,  b)  of  integers  that  correspond  to  elements 
a  +  b\/ 2  €  Z[y/2].  Equations  (2.1)  and  (2.2)  give  the  rules  of  addition  and  multiplication 
for  the  ordered-pair  representation. 

In  a  practical  implementation,  the  sizes  of  these  coefficients  must  be  constrained. 
In  particular,  for  an  integer  A/,  define  the  set  Z[\/2]m  =  {a  +  6\/2  :  a,  b  6  Z,  |a|  < 
M/2,  |6|<  M/2}.  Z[y/2]M\s  a  finite  set.  Figure  2  displays  the  25  elements  of  Z[s/2]\. 
This  case  is  indicative  of  the  general  situation  that  such  finite  collections  of  algebraic 
integers  form  nonuniform  quantizers,  as  compared  to  the  usual  uniformly  spaced  quan¬ 
tizers. 


_La _ La a_L» »  _ a_aJ »  » * _ aJ_ 

-5  -4  -3  -2  -1  0  1  2  3  «  5 


Figure  2.  Z(\/2  ]4 


2.1.2  Adjoining  e2**/8 

Whereas  the  last  section  involved  a  real  extension,  this  section  develops  a  complex 
extension,  applicable  in  the  case  of  signal  processing  with  I  and  Q  channels.  This 
example  can  be  viewed  as  a  generalization  of  the  Gaussian  integers  Z[t],  the  ring  of 
algebraic  integers  for  Q (*)  =  {a  +  bi  :  a, 6  G  Q}.  Let  u  =  e2x'/8  denote  the  primitive 
complex  eighth  root  of  unity.  Since  w4  =  -1,  w  is  a  root  of  x4  +  1,  its  minimum 
polynomial,  and  u i  is  an  algebraic  integer. 

Ifu;  is  adjoined  to  Q,  the  field  <^(w)  =  {ao+aiu;+a2W2-fa3u;3  :  a,-  €  Q,  *  =  0, 1, 2, 3} 
results.  The  representation  subset  is  Z[u\  =  {ao  -I-  a\u>  +  fl^w2  +  a^ui3  :  a,  €  Z,  i  = 
0, 1,2,3}.  Z[u;]  can  be  viewed  geometrically  as  in  figure  3  as  containing  integer  linear 
combinations  of  the  basis  vectors  1,  w,  u;2,  and  u;3.  These  four  vectors  do,  in  fact,  form 


7 


Figure  3.  Basis  Vectors  for  Z[u 


a  vector  space  basis  when  the  coefficients  are  restricted  to  lie  in  Q;  that  is,  the  elements 
of  ZM  are  represented  uniquely. 

Elements  of  Z[u;]  can  also  be  viewed  as  polynomials  in  indeterminate  ui  with  the 
additional  relation  that  u>  satisfies  the  equation  u ;4  =  —1.  Elements  of  Z[cj]  are  added 
and  multiplied  like  polynomials,  except,  in  the  case  of  multiplication,  higher  powers  of 
u>  are  reduced  using  the  relation  u;4  =  —1. 

It  is  convenient  to  represent  the  multiplication  of  elements  of  Z[u>)  in  terms  of  matri¬ 
ces  and  vectors.  Denote  the  element  ao-f  aiu;  +  a2u>2 +  <i3U/3  by  the  vector  (ao,  aj,  02,  a 3), 
and  consider  the  product 

(00,01,02,03)  *  (60,  1>2,  63)  =  (C0,C1,C2,C3). 
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The  constant  term  cq  is  computed  by  the  following  cyclic  convolution: 


co  =  ao&o  —  0163  -  0262  —  0361, 


which  in  matrix  form  is 


co  =  (00,01,02,03) 


(2.3) 


The  matrices,  which  will  be  called  coefficient  forms ,  for  the  coefficients  ci,C2,  and 
C3  axe,  respectively, 


(010  0  \ 
1  0  0  0  ] 
00  0  -1  I 
0  0-1  0  / 


(001  0  \  /0  0  0  1\ 
0  10  0  I  I  0  0  1  0  J 

100  o  ’ (01  00 
000-1/  \l  0  0  0/ 


(2.4) 


With  addition  and  multiplication  defined  as  above,  Z[u>]  forms  a  ring  and  is,  in  fact,  the 
ring  of  algebraic  integers  for  Q(u;). 

Z[u>]  is  a  dense  subset  of  C,  and  so  any  complex  number  x  +  yi  can  be  approximated 
arbitrarily  well  by  elements  of  Z[u>|.  As  in  the  first  example,  for  an  integer  M,  define 
Z[u]m  =  {ao  +  a\u  +  a^u2  +  a^u3  :  a{  €  Z,  |at (  <  M/2,i  =  0, 1,2,3}.  1[u]m  is  a 
finite  set  and  forms  a  (nonuniform)  quantizer  of  C.  The  representatives  in  Z[u>]m  are 
regarded  as  vectors  (ao,  aj,  a2, 03)  and  are  added  using  ordinary  vector  addition  and  are 
multiplied  using  (2.3)  and  (2.4). 

It  is  possible  to  picture  Z[u;]m  in  R2.  If  x  +  yi  =  (00,01,02,03),  then  since  u>  = 

y/2/2  +  *V 2/2, 

x  =  a0  +  —  (oi  -  03),  (2.5) 


V  =  02  +  —  (at  +  03).  (2.6) 

Figure  4  shows  the  81  points  of  Z[w]2.  These  points  display  eight-fold  rotational  sym¬ 
metry  since  if  (ao, ai,  02, 03)  €  Z[u]m,  then 


w(ao,  01,02,03)  =  (-03,00,01,02) 
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Figure  4.  Z[e2x*/8]2 


is  in  Z[u\m. 

Finally,  there  is  a  relationship  between  Z[\/2]  and  Z[u>].  Consider  an  element 
x  +  yi  =  (ao,«i,  02,03)  €  Z[u]  D  R.  Since  a\,  a2,  and  03  are  integers,  (2.6)  implies  that 
a  point  with  y  =  0  must  have  a2  =  0  and  ai  =  —03.  The  expression  (2.5)  for  x  becomes 

x  =  ao  —  v/2a3. 

Thus,  Z(y/2]  corresponds  to  the  real  numbers  of  Z[e2T‘/8]. 


2.1.3  Adjoining  \J2  +  s/2 


This  section  develops  a  4th  degree  real  extension.  Let  0  =  y/2  +  \/2.  The  mini¬ 
mum  polynomial  for  0  is  a:4  —  4x2  +  2.  Integer  linear  combinations  of  the  basis  vectors  1, 
9,  92,  and  9 3  yield  Z[0]  =  {ao  +  ai0  +  O202  +  O303  :  a,  6  Z,i  =  0, 1,2,3}.  Elements  of  Z[0] 
can  be  viewed  as  polynomials,  and  a0  +  a\9  -I-  atf1  -f  a303  is  denoted  by  (ao,  ai,  a2,  a3). 

When  elements  of  Z[0]  are  multiplied,  the  relation  94  =  4 92  —  2  is  used  to  re¬ 
duce  powers  of  9  above  three.  The  coefficient  forms  for  a  product  (co,ci,C2,C3)  are, 
respectively, 


( 


1 

0 

0 

0 


0  0 
0  0 
0  -2 
-2  0 

(0  0  1 
0  1  0 
1  0  4 
0  4  0 


0  N 

/° 

1 

0 

0  \ 

-2 

|  1 

0 

0 

0  1 

0 

’(  0 

0 

0 

-2 

-8/ 

\o 

0 

-2 

0  / 

0\  /0  0  0  1\ 

4  ]  [  0  0  1  0  ] 

0  ’  0  1  0  4 

14/  \1  0  4  0/ 


(2.7) 


For  example, 


(0,0, 0,1)  *(0,0, 0,1)  =  (-8,0, 14,0). 


(2.8) 


For  an  integer  Af,  define  the  set  l[9)M  ==  {a0  +  ax0  +  a292  +  a393  :  a,  €  Z,  |a.|  < 
M/2,i  =  0, 1,2, 3}.  2{9}m  is  a  finite  set  and  forms  a  nonuniform  quantizer  for  R. 

An  algebraic  integer  can  often  be  represented  using  more  than  one  basis.  This  is 
demonstrated  for  Z[fl],  The  usual  polynomial  basis,  {1,0,02,03}  will  be  denoted  by  Bi. 
The  number  \/2  —  >/2  is  in  Z [0]  since 


V2-V2  =  -3y/2+y/2  +  (>/2  +  s/2)  . 

Let  B2  =  {1,  \/2  4-  \/2,  -s/2,  \/2  —  \/2};  then  Z[0]  can  be  represented  also  in  terms  of 
the  elements  of  B2. 

This  is  most  easily  seen  in  terms  of  the  change  of  basis  matrix.  Let  (00,01,03,03) 
correspond  to  Qo-Hai0+Q202+a303,  and  let  (60,  &i>  &2, 63)  correspond  to  &0+61  \/2  +  \/2+ 
hV?  4-  03 v2  —  \/2.  The  two  representations  are  then  related  by: 


(00,01,02,03) 


/  1 

0 

-2 

\  0 


0  0  0\ 
1  0  0  | 
0  1  0  I 

-3  0  1/ 


(00,01,03,03), 
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(6o,6l,&2>&3)  =  (^0,^1,02,03) 


(2.9) 


(1  0  0  0\ 

0  1  0  0  | 

2  0  1  0  I  ■ 

0  3  0  1/ 

In  general,  an  invertible  integer  matrix  corresponds  to  a  suitable  change  of  basis  matrix 
exactly  when  the  inverse  matrix  contains  only  integers. 

There  are  two  general  remarks  that  apply  to  representing  algebraic  integers  using 
different  bases.  When  coefficient  sizes  are  restricted,  the  finite  set  Z [0]m  obtained 
depends  on  the  choice  of  basis.  In  this  example,  since  the  elements  of  B2  are  closer 
together  in  magnitude  than  the  elements  of  Bj,  there  are  more  points  clustered  around 
zero  for  the  B2  basis.  This  may  be  an  advantage  depending  on  the  density  of  the  inputs 
being  quantized. 

Secondly,  the  dynamic  range  growth  that  results  when  two  elements  of  Z[0]  are 
multiplied  depends  on  the  basis  representation.  In  this  example,  the  coefficient  forms 
for  a  product  (co,  cj,  C2,  C3)  expressed  using  the  B2  basis  are,  respectively, 

/0  0  1  0  \ 

I  0  1  0  1  ) 

11  0  0  0  ’ 

Vo  1  0  -1/ 

Now,  for  example,  (comparing  to  (2.8)) 

(0, 0, 0, 1)  *  (0, 0,0, 1)  =s  (2, 0,  -1,0),  (2.1 1) 

and  B2  yields  the  preferred  representation  from  this  standpoint. 

2.1.4  Adjoining  e2*-*/16 

This  final  example  involves  an  8th  degree  complex  extension  that  will  turn  out  to 
intersect  R  at  Z[v2  +  v^2]-  Let  u  =  e2*-'/16  denote  the  primitive  complex  16th  root  of 
unity.  The  minimum  polynomial  of  u>  is  x8  +  1.  This  example  is  very  similar  to  the 
previous  Z[e2,M/8]  example. 

Z[u>]  can  be  regarded  as  consisting  of  polynomials  of  degree  seven  with  integer 
coefficients.  The  rule  u>8  =  —  1  is  used  in  the  product  to  reduce  the  degree  of  powers  of 
u;  above  seven.  Ifx+yi  =  (ao,ai, . . .  ,07)  €  Z[u>],  since  u>  =  \/2  +  \/2/2+(\/2  —  \/2/2  )i, 
then 

z  =  ao  +  (\/2  4-  y/2/2)(ai  —  <17)  4-  (\/2/2)(a2  —  as) 

+  (v/2^72/2)(a3  -  a5),  (2.12) 


(1  0  0  0\  /0  1  0  0\ 
02001  |10  10] 

0  0  2  ol’IO  1  0  ll’ 
0002/  \0  0  1  0/ 


(2.10) 
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y  =  o.\  +  (\/2  4-  v/2/2)(<i3  4-  as)  4-  (\/2/2)(a2  4-  ae) 

4-  (\/2  —  \/2/2)(ai  +  ar).  (2.13) 

Figure  5  shows  the  6,561  points  of  Z[w]2.  These  points  have  16-fold  rotational  symmetry. 


Figure  5.  Z[e2**/16]2 


Finally,  consider  an  element  x  +  yi  =  (ao,ai,. . .  ,07)  €  ZflR.  Since  the  coefficients 
are  integers,  (2.13)  implies  a  point  with  y  =  0  must  have  04  =  0,  <17  =  —  a\,  as  =  —02, 
and  a$  =  —03.  The  expression  (2.12)  for  x  becomes 

x  =  ao  +  y/2  +  \/2  aj  -)-  \/2a2  4-  \/2  —  \/2  03.  (2.14) 

Thus,  Z[\/2  4-  \/2],  expressed  using  the  B2  =  {1,  \/2  +  \/2 ,  \/2,  \/2  —  \/2}  basis,  cor¬ 
responds  to  the  real  numbers  of  Z(e2T,/16j. 


13 


2.2  RESIDUE  NUMBER  SYSTEM  PROCESSING  WITH 
ALGEBRAIC  INTEGERS 

In  most  digital  arithmetic  circuits  there  is  an  inherent  trade-off  between  speed 
and  precision  of  calculation  due  to  factors  such  as  carry  propagation,  for  example. 
One  way  around  this  problem  is  to  perform  the  computations  using  a  residue  number 
system  (RNS).  In  this  approach,  high-dynamic-range  integer  processing  is  performed  in 
a  number  of  parallel  low-dynamic-range  channels.  For  a  comprehensive  introduction  to 
RNS  digital  signal  processing,  including  reprints  of  certain  key  papers,  see  [19]  .  This 
section  briefly  introduces  ordinary  integer  RNS  and  shows  how  these  ideas  extend  to 
the  algebraic-integer  representation. 


2.2.1  Integer  RNS  Processing 

A  residue  number  system  is  defined  by  n  positive  integers  m,  that  are  all  relatively 
prime  (that  is,  the  greatest  common  divisor  of  any  pair  of  them  is  one).  The  integers 
m,  are  referred  to  as  the  system  moduli ,  and  the  number  M, 

n 

M  -  II  m„ 

i=i 

is  called  the  system  range.  Given  an  RNS  and  an  integer  X,  the  Chinese  Remainder 
Theorem  (CRT)  states  that  X  is  represented  uniquely  up  to  a  multiple  of  M  by  its 
residues  Xj, 

xi  =  (X)mi ,  i  =  1,2, ...  ,n, 

where  the  operation  ( X)m ,  read  X  modulo  m,  indicates  taking  the  nonnegative  remain¬ 
der  of  the  quantity  X  divided  by  m.  As  a  consequence  of  the  CRT,  then  any  integer  X 
between  0  and  M  —  1,  inclusive,  can  be  reconstructed  from  its  residues  x,. 

For  an  integer  m,  the  residues  modulo  m  form  the  set  {0,  1,  . . . ,  m  —  1},  which  is 
denoted  by  Zm.  Zm  forms  a  ring  with  addition  and  multiplication  defined  for  a,  b  6  Zm 
by: 

a  +  b=  (a  +  b)m, 
ab  =  (ab) 

Some  other  notation:  for  an  integer  X  and  a  €  Zp,  X  =  a  (mod  m)  means  (.Y)m  =  a. 

In  integer  RNS  processing,  the  numerical  quantities  are  quantized  to  integers, 
perhaps  by  scaling  and  rounding,  and  reduced  modulo  the  system  moduli.  The  same 
calculation,  consisting  of  additions  and  multiplications,  is  performed  independently 
in  parallel  channels,  one  for  each  of  the  system  moduli.  The  CRT  is  then  used  to 
reconstruct  the  result,  which  is  correct  provided  the  actual  answer  is  smaller  than  the 


14 


system  range.  The  limiting  factors  on  speed  are  usually  the  input  and  output  conversion 
processes,  where  the  system  goes  into  and  comes  out  of  the  residue  representation. 


2.2.2  Algebraic-Integer  RNS  Processing 

An  algebraic  integer  may  be  naturally  represented  in  several  ways.  The  choice  of 
representation  depends  on  a  number  of  factors,  for  example,  on  the  analog  signals  being 
quantized,  or  on  the  need  to  minimize  the  dynamic  range  growth  due  to  multiplication. 
In  order  to  make  this  discussion  more  concrete,  it  is  assumed  that  the  algebraic  integers 
are  of  the  form 

ao  +  a\0  +  <12^  +  —  +  On—i^n  \  (2.15) 

where  0  has  a  minimum  polynomial  f(x)  of  degree  n.  Other  algebraic-integer  represen¬ 
tations  are  considered  in  section  5. 

Let  the  RNS  consist  of  the  primes  pi,p2,  . . . ,  pm.  This  restriction  is  being  made 
to  ease  the  exposition;  generalizations  to  arbitrary  relatively  prime  moduli  are  pointed 
out  in  section  5.  RNS  processing  with  algebraic  integers  was  depicted  schematically 
in  figure  1.  After  analog  signals  are  converted  to  algebraic  integers,  each  coefficient  of 
the  algebraic  integer  is  reduced  modulo  the  primes  in  the  RNS.  Since  the  coefficients 
of  the  algebraic- integer  representations  are  likely  to  be  smaller  than  the  primes  in  the 
RNS,  the  reduction  modulo  pi  will  usually  just  amount  to  changing  the  negative  coeffi¬ 
cients  from,  say,  2’s-complement  to  p, ’s-complement  form.  This  first  modulo  reduction, 
corresponding  to  integer  RNS,  yields  the  so-called  outer  level  of  parallelism. 

In  each  modulo  p  outer  channel*  the  algebraic  integer  (2.15)  is  represented  by  the 
vector  of  residues 

{(ao)p,  (<*i }p, . . . ,  (an-i)p).  (2.16) 

While  addition  of  these  representatives  is  performed  coefficient-by-coefficient  in  n  inde¬ 
pendent  parallel  channels,  the  multiplication  involves  a  convolution  that  requires  cross¬ 
talk  between  the  channels  (see  the  coefficient  forms  of  the  examples  in  section  2.1). 
It  is  possible,  however,  to  choose  the  prime  p  so  that  there  exists  what  amounts  to 
a  number-theoretic  transform  that  can  be  applied  to  (2.16)  to  produce  a  vector  of  n 
residues  modulo  p.  These  transformed  representatives  can  now  be  added  and  multiplied 
by  considering  each  coefficient  independently,  and  processing  is  accomplished  by  n  in¬ 
dependent  modulo  p  channels;  this  is  the  inner  level  of  parallelism ,  which  is  depicted 
in  figure  1. 

The  polynomial  version  of  the  CRT  provides  the  theoretical  basis  for  the  inner 
level  of  parallelism  in  algebraic-integer  RNS.  The  representative  (2.16)  is  viewed  as 
a  polynomial  over  Zp;  that  is,  an  element  of  the  ring  Z v[x).  But  more  is  true:  the 
indeterminate  0  satisfies  the  relation  f(9)  =  0,  so  that  (2.16)  can  be  regarded  as  an 
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element  of  the  residue  ring  Zp[x]  modulo  /(x),  which  is  denoted  by  Zp[x]//(x).  Now 
the  polynomial  version  of  the  CRT  involving  the  factors  of  f(x)  can  be  applied;  section 
5  gives  the  complete  treatment. 

It  is  useful  to  state  here  the  condition  on  the  prime  p  that  guarantees  the  existence 
of  the  inner  level  of  parallelism,  namely,  that  the  prime  p  should  be  chosen  so  that  the 
polynomial  f(x)  factors  completely  into  linear  factors  over  Zp.  For  example,  for  Z[\/2], 
f(x)  —  x2  —  2  and  Zp  should  contain  a  It  can  easily  be  checked  that  p  =  7, 

17,  23,  31,  41,  47,  71,  73,  79,  89,  97,  103,  113,  119,  and  127  are  the  suitable  primes 
with  seven  bits  or  less.  For  Z[e2*’*/8]  and  Z[e2*’'/16],  the  condition  on  p  is,  respectively, 
p  =  1  (mod  8)  and  p  =  1  (mod  16).  For  Gaussian  integers  Z[z],  the  condition  is  p  =  1 
(mod  4),  corresponding  to  the  familiar  case  of  QRNS. 


2.3  HISTORY  OF  ALGEBRAIC-INTEGER  PROCESSING 

The  idea  of  using  algebraic  integers  in  RNS  signal  processing  can  be  traced  to  eai- 
lier  work  on  QRNS.  Motivated  by  work  on  QRNS,  which  was  eventually  published  in  [5] 

,  Cozzens  and  Finkelstein,  in  January  1983,  generalized  this  idea  to  higher-degree  cy- 
clotomic  extensions.  Their  paper  [3]  focused  on  using  Z[e2x'/2  ],  cyclotomic  extensions 
of  degree  a  power  of  two,  to  compute  the  discrete  Fourier  transform.  A  subsequent 
paper  [4]  gave  a  more  complete  error  and  range  analysis  for  this  case,  including  a 
methodology  for  computing  an  achievable  upper  bound,  called  the  exac '  hound ,  on  the 
range  of  the  coefficients  in  the  case  of  a  two-stage  quantization  scheme,  vhich  is  given 
in  detail  in  section  3.2. 

It  was  clear  from  the  outset  that,  in  addition  to  having  nice  algebraic  properties, 
the  algebraic  integers  possessed  a  rich  quantization  structure.  The  paper  [6]  considered 
the  algebraic-integer  approximation  problem,  deriving  error  estimates  for  the  case  of 
Z [e2x‘/8]  in  a  region  near  the  origin.  A  subsequent  paper,  [7]  ,  outlined  an  approxi¬ 
mation  algorithm  that  was  implicit  in  the  proofs  in  [6]  .  Both  papers  made  the  point 
that  to  obtain  efficient  algebraic- integer  approximations,  scaling,  as  in  the  conventional 
approach,  and  direct  algebraic-integer  approximations  had  to  be  combined.  The  paper 
[14]  developed  an  approximation  routine  for  Z[e2r,/8j  by  treating  real  and  imaginary 
parts  separately,  an  approach  similar  to  the  one  in  section  3.3. 

Algebraic-integer  number  representations  were  used  in  [l]  (Z[\/2  ])  to  design  a  two- 
dimensional  discrete  cosine  transform,  and  in  [2]  (Z[e2T‘/16])  to  compute  the  numerically 
approximate  inverse  of  a  square  matrix  defined  over  the  complex  numbers.  The  paper 
[1]  also  independently  derived  the  exact  bound  methodology. 
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2.4  CONCLUSION 

The  algebraic-integer  number  system  was  introduced  using  four  examples:  Z[v/2  ] , 
Z[e"T'/®],  Z[\/2  +  y/2],  and  Zfe2*'/16].  In  general,  the  cyclotomic  extensions  Z[e2T'/2  ].v/ 
have  rotational  symmetries  not  enjoyed  by  other  complex  extensions,  small  multiplica¬ 
tive  dynamic  range  growth,  and  representatives  with  2r_1  coefficients — all  features  that 
make  these  extensions  attractive  for  signal  processing  applications  involving  I  and  Q 
channels.  The  real  extensions  Z[y/2]  and  Z[y/2  +  y/2]  can  be  obtained  from  Z [e2*1/8] 
and  Z[e2*'/16]  by  restricting  to  the  real  line,  a  fact  that  will  prove  useful  in  section  3.  Al¬ 
though  many  other  extensions  are  possible,  the  four  examples  introduced  are  expected 
to  play  an  important  role  in  future  algebraic-integer  implementations. 

The  algebraic-integer  representation  was  combined  with  RNS  processing.  For  each 
prime  p  of  a  suitably  chosen  RNS,  the  ordinary  outer  channel  corresponding  to  p  splits 
into  parallel  modulo  p  inner  channels,  one  for  each  coefficient  in  the  algebraic-integer 
representation.  Besides  yielding  an  increase  in  RNS  parallelism,  this  further  splitting 
makes  the  complicated  algebraic-integer  multiplication  easy  to  compute  in  parallel. 


SECTION  3 

ANALOG-TO- ALGEBRAIC-INTEGER  CONVERSION 


This  section  considers  the  problem  of  representing  analog  values  in  terms  of  alge¬ 
braic  integers.  There  are  two  types  of  values  that  need  to  be  represented:  constants 
that  are  known  beforehand,  such  as  the  coefficients  of  a  time-invariant  filter,  and  the 
varying  inputs  to  a  real-time  processor.  The  former  do  not  present  as  much  of  a  problem 
since  they  can  be  approximated  beforehand  using  exhaustive  computer  routines.  The 
problem  involving  the  latter  type  is  the  subject  of  this  section. 

Two  approaches  are  considered.  The  first  is  a  direct  analog-to-algebraic-integer 
conversion  for  real  algebraic  integers.  This  approach  (actually  two  methods  are  sug¬ 
gested)  uses  nonlinear  functions,  making  actual  hardware  implementations  difficult. 
The  second  approach,  which  also  applies  in  the  case  of  real  algebraic  integers  and  repre¬ 
sents  an  engineering  compromise,  uses  a  (conventional)  uniform  analog-to-digital  (A/D) 
converter  followed  by  a  precomputed  approximation  table,  which  performs  a  digital-to- 
algebraic-integer  conversion.  (The  digital-to-algebraic-integer  conversion  may  be  ail 
that  is  required  of  an  algebraic-integer  processor  that  is  embedded  in  a  conventional 
digital  processor.) 

The  complexity  of  the  problem  grows  if  the  inputs  are  complex,  and  a  strategy 
that  treats  separate  I  (real)  and  Q  (imaginary)  channels  is  described.  Finally,  the 
performance  of  the  different  algebraic-integer  quantizers  is  evaluated  for  the  case  of 
Z[\/2  +  y/2}. 

3.1  DIRECT  ANALOG-TO-ALGEBRAIC-INTEGER 
CONVERSION 

In  this  section,  all  algebraic  integers  considered  are  real.  Two  methods  for  di¬ 
rect  analog-to-algebraic-integer  conversion  are  presented.  The  first  uses  the  compressor 
characteristic  of  the  nonuniform  algebraic-integer  quantizer.  The  second  is  similar  to  a 
conventional  successive  approximation  A/D  converter. 


3.1.1  Compressor  Characteristic  Method 

Finite  sets  of  real  algebraic  integers  with  bounded  coefficient  sizes  generally  cor¬ 
respond  to  nonuniform  quantizers  of  R.  In  general,  nonuniform  quantization  can  be 
achieved  by  compressing  the  signal  by  a  nonuniform  compressor  characteristic  c,  by 
quantizing  the  compressed  signal  with  a  uniform  quantizer,  and  then  applying  the  in¬ 
verse  c  1  to  the  quantized  signal.  The  compressor  characteristic  c  is  also  called  the 
companding  law  (for  compressing  and  expanding)  [lOj  . 
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Figure  6  shows  the  limiting  values  of  the  compressor  characteristic  for  the  sets 
Z[ \/2]m  and  Z[\/2  +  v/2  ] a/  (using  the  nonpolynomial  B2  basis)  as  the  value  of  M 
tends  toward  infinity.  Only  the  positive  values  are  shown  (scaled  to  the  interval  [0,1]); 
the  negative  values  are  obtained  by  reflection  through  the  origin.  The  curve  for  Z[\/2] 
was  obtained  analytically  by  considering  the  density  of  points  in  Z[y/2].  It  consists  of 
a  linear  part  near  the  origin  and  a  quadratic  part  beginning  at  the  point  (\/2  —  l)2. 
The  curve  for  Z[\/2  +  \/2\  wa s  obtained  by  computer  (a  similar  analysis  for  this  case 
is  possible  but  is  much  more  complicated). 


Figure  6.  Compressor  Characteristics  for  Real  Algebraic  Integers 
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The  compressor  characteristic  cam  be  used  to  obtain  a  direct  analog- to-algebraic- 
integer  converter.  The  converter  is  based  on  the  following  property:  if  the  points 
of  1[\/2]m,  for  instance,  are  scaled  so  that  the  largest  value  becomes  1,  then  the 
compressor  characteristic  for  Z[y/2]  maps  these  points  to  (nearly)  uniformly  spaced 
points  between  —1  and  1  on  the  y-axis.  A  uniform  A/D  converter  is  constructed  using 
these  representation  levels,  except,  instead  of  associating  the  usual  digital  code  with 
each  level,  the  corresponding  algebraic-integer  code  is  used.  An  analog  input,  scaled  to 
lie  in  the  interval  [—1,  1],  is  quantized  by  first  applying  the  compressor  characteristic 
to  it  and  then  using  the  uniform  A/D  converter  on  the  compressed  signal. 

Although  a  nonlinear  function  has  been  applied  to  the  signal,  linear  signal  pro¬ 
cessing  (without  undesirable  intermodulation  products  beyond  those  that  result  from 
the  quantization  error)  is  still  possible.  This  is  because  the  algebraic-integer  codes  are 
being  used,  which  has  the  same  effect  as  applying  the  inverse  of  the  nonlinearity  to 
the  signal.  This  is  in  contrast  to  the  situation  where  a  nonlinear  function  is  applied  to 
the  signal,  such  as  the  p-law  companders  used  in  speech,  and  the  binary  codes  from 
the  uniform  A/D  converter  are  used  directly.  In  this  case,  linear  signal  processing  with 
these  codes  would  produce  unacceptable  intermodulation  products. 

Implementing  accurately  in  hardware  such  nonlinear  compressor  functions  is  diffi¬ 
cult,  and  it  is  now  more  common  to  use  a  piecewise  linear  approximation  [10]  .  The  next 
section  gives  another  strategy  for  direct  analog-to-algebraic-integer  conversion  that  also 
uses  nonlinear  functions,  but  in  this  case  ail  that  is  required  is  that  the  zero-crossings 
are  located  correctly.  This  may  prove  to  be  easier. 


3.1.2  Successive  Approximation  Method 

It  is  possible  to  translate  the  real  algebraic-integer  quantization  problem  into  a 
form  that  more  closely  resembles  conventional  quantization.  For  simplicity,  this  is 
demonstrated  in  the  case  of  Z[y/2].  Suppose,  for  example,  the  coefficients  a  and  6  of 
a  -f  by/2  €  Z[y/2\  are  2-bit  integers  in  2’s-complement  form;  that  is,  a  =  —  2Vai  4-  Vao 
and  b  =  — 2Vji  +  Vjq,  where  Vxx  =  0,  1  (so  —2,  —1,0,  and  1  are  represented  respectively 
by  VxiVx0  =  10,  11,  00,  and  01).  Note  that  the  2’s-complement  form  for  the  coefficients 
yields  an  asymmetric  representation  set,  which  is  denoted  by  Z[v/2  ]{_2,— 1,0,1}- 

Then  a  -f  by/2  can  be  rewritten  as 

-2Val+Va0-2y/2Vbl  +  y/2Vb0.  (3.1) 

Arranging  the  constants  in  descending  magnitude,  and  defining  V3,  Vo,  Vi,  and  Vo 
appropriately,  (3.1)  becomes 

-2^2  V3  -  2V2  -f  V2  Vx  +  Vo-  (3.2) 
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Table  2.  Binary  Codes  for  Z[v/2]{_2t_i,o,i} 


Binary 

Decimal 

Decision  Thresholds 

0011 

1  +  y/2  «  2.414 

(1  +2v/2)/2,oo 

0010 

\/2«  1.414 

(1  +  n/2)/2,(1  +  2v/2)/2 

0001 

1 

v/5/2,(1+n/2)/2 

0111 

-1  +  ,414 

(-1  +  x/2  )/2,  v/2 /2 

0000 

0 

(1-  V^)/2,(-l  +  V^)/2 

1011 

1  -  sjl  «  -.414 

—  1/2,  (1  —  y/2  )/2 

011 0 

—2  +  y/2  «  -.586 

(—3  +  n/2  )/2,  — 1/2 

0101 

-1 

(— 1  —  v/2)/2,(— 3  +  n/2)/2 

1010 

-n/2  «  -1.414 

(1  -  3v/2)/2,(-l  -  n/2)/2 

1001 

1  -  2y/2  %  -1.828 

(-1  -2y2)/2,(l  —  3x/2  )/2 

0100 

-2 

(-3-v/2)/2,(-l-2x/2)/2 

1111 

-1  -  n/2  «  -2.414 

(-1-3n/2)/2,(-3-n/2)/2 

1000 

-2^2  «  -2.828 

(-2-3x/2)/2,(-1  -3n/2)/2 

1110 

1 

to 

1 

5* 

ll 

1 

CO 

+ 
►— > 

(-3-3v/2)/2,(-2-3v/2)/2 

1101 

-1  -  2n/2  «  -3.828 

(—3  —  4\/2  )/2,  (—3  —  3\/2  )/2 

1100 

-2  -  2V2  %  -4.828 

-c5o,(-3-4v/2)/2 

Table  2  lists  the  16  possible  codes  for  Vj  V2  V\  Vq  ,  the  respective  decimal  values,  and 
the  respective  decision  thresholds  (which  occur  at  the  midpoints  between  successive 
representation  values — the  optimal  placement  [10]  ). 

It  is  useful  to  compare  the  representation  in  (3.2)  with  the  4-bit  2’s-complement 
representation 

-23V'3  +  2V2  +  2K, +  M,.  (3.3) 

The  magnitude  of  the  weights  in  (3.2)  and  (3.3)  are  respectively  {2\/2, 2,  \/2, 1 }  and 
{23,22,2,  1}.  Whereas  the  absolute  values  of  the  weights  in  (3.3)  are  superincreasing 
(2  >  1,  22  >  2  +  1,  and  23  >  22  4-2  +  1),  the  weights  in  (3.2)  are  not.  This  fact  accounts 
for  the  density  of  the  elements  in  Z [\/2 ]{_2,— 1,0,1}- 

The  structure  that  performs  the  conversion  for  Z[\/2]{_2, -1,0,1}  's  shown  in  figure  7. 
The  analog  input  voltage  x  enters  at  the  top  and  is  applied  to  each  of  the  four  columns. 
In  the  offset  level ,  the  voltage  is  offset  as  a  function  of  the  0-1  values  of  V3,  lA,  V\ ,  and 
Vq  by  the  weights  indicated.  For  example,  if  (V3,  V2,  Vi,  Vo)  =  (1,0, 0,0),  then  x  +  2\/2 
is  input  into  the  next  level,  called  the  functional  level.  The  values  of  the  functions  <73, 
<72 1  <71,  and  go  will  be  derived  subsequently.  Each  device  in  the  functional  level  evaluates 
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the  indicated  function  at  the  input  voltage  and  outputs  the  result  to  the  threshold  level. 
Each  device  in  the  threshold  level  outputs  the  value  0  or  1  depending  on  whether  the 
input  is  negative  or  not.  These  values,  which  correspond  to  the  V,,  are  fed  back  to  the 
offset  level. 


Figure  7.  Analog-to-Z[v/2]{_2, -1,0,1}  Converter 


The  converter  operates  like  a  conventional  successive  approximation  A/D  con¬ 
verter,  except  that  the  threshold  decisions  are  complicated  by  the  fact  that  the  weights 
are  not  superincreasing.  This  complication  is  embodied  in  the  functions  <7,,  which  are 
now  derived.  The  V3  output  only  depends  on  the  values  of  the  function  <73.  Thus,  the 
function  <73  should  be  positive  on  those  intervals  where  V3  =  1  and  negative  on  those 
intervals  where  V3  =  0.  The  following  5th  degree  polynomial  with  this  property  can  be 
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obtained  by  inspecting  table  2: 


9i{x)  =  —(x  -  (-3  -  V2)/2)(x  -  (-1  -  2v/2)/2) 

(x  ~  (~1  -  )/2)(x  -  (-l/2))(x  -(l-y/2  )/2).  (3.4) 

The  roots  of  (73  correspond  to  decision  thresholds  where  V3  changes  value. 

Once  the  value  of  V3  is  determined,  then  a  3-bit  representation  for  x  +  2\/2  V3 
involving  the  weights  —2,  \/2,  and  1  needs  to  be  determined.  If  V3  =  0,  then  the  input 
x  is  passed  unaffected  to  the  3-bit  converter  (the  part  in  figure  7  corresponding  to  V2, 
Vi,  and  Vo).  If,  on  the  other  hand,  V3  =  1,  then  x  +  2\/2  is  passed  to  the  3-bit  converter. 
In  any  case,  the  3-bit  converter  determines  V2,  V\,  and  Vo  so  that 

x  +  2y/2V3  =  -2V2  +  y/2Vl  +  Vo,  (3.5) 

determining  the  element  of  Z[\/2]  {-2,-1, 0,1  }• 

The  3-bit  converter  is  constructed  in  a  similar  fashion  using  the  weights  —2,  y/2, 
and-1.  Again,  these  weights  are  not  superincreasing  and  the  function  <72  is  nonlinear. 
The  polynomial 

g2(x)  =  -(x  -  (-2  +  y/2  )/2)(x  -  (-1  +  V5 )/2)(x  -  y/2/2)  (3.6) 

has  the  correct  properties;  that  is,  g2(x)  is  positive  when  V2  =  1,  and  g2(x)  is  negative 
when  Vo  =  0.  It  was  derived  in  the  same  way  <73  was,  except  that  only  the  weights 
—2,  \/2,  and  1  were  used.  Now  the  offset  to  the  2-bit  converter  is  0  or  2  depending  on 
whether  V2  =  0  or  1. 

Continuing  in  this  fashion,  <71  and  g2  are  determined  to  be: 

<7i(x)  =  2x-(l  +  v/2),  (3.7) 

<7o(z)  =  2x  —  1,  (3.8) 

with  an  offset  of  —  \/2  from  the  2-bit  to  the  l-bit  converter. 

With  these  definitions  of  <73,  g2,  g\,  and  go,  the  converter  in  figure  7  obtains  the 
correct  Z[\/2  ]|_2,— 1,0,1}  representative  for  any  analog  input  x.  The  structure  can  be 
clocked  or  run  asynchronously — the  analog  input  is  applied  and  the  network  is  allowed 
to  settle. 

The  accuracy  of  the  converter  does  not  depend  on  implementing  the  exact  shape 
of  the  functions  <7,;  it  depends  just  on  implementing  the  zero-crossings  of  the  functions 
correctly.  Thus,  the  values  of  the  gi  can  be  driven  to  the  rails  and  be  replaced  with 
±C,  for  a  constant  C .  Consequently,  the  accuracy  would  depend  in  large  part  on  the 
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ability  to  accurately  implement  these  characteristic  functions.  One  possible  approach 
would  be  to  use  circuits  similar  to  those  used  in  a  window  comparator.  Figure  8  plots 
<73,  <72,  <71,  and  go  after  hard-limiting  for  the  Z[v/2]{_2,-i,o,i}  example. 

Although  the  successive  approximation  method  was  illustrated  for  the  set 
Z[V5l,_  2,— 1,  oti),  it  can  be  applied  to  other  cases.  However,  the  complexity  grows 
rapidly  as  either  the  number  of  bits  or  the  degree  of  the  extension  is  increased.  The 
number  of  zero-crossings  required  by  each  function  in  the  functional  level  is  a  good 
measure  of  the  complexity  of  the  method.  The  example  had  complexity  (5,  3,  1,  1).  No 
attempt  was  made  to  evaluate  the  complexity  of  the  method  in  general. 


3.2  TWO-STAGE  ANALOG-TO-ALGEBRAIC-INTEGER 
CONVERSION 

An  analog-to-algebraic-integer  converter  can  be  constructed  using  a  conventional 
A/D  converter  that  is  followed  by  a  table  of  real  algebraic-integer  approximations.  A 
conventional  A/D  converter  is  selected  that  meets  the  speed  requirements  of  the  proces¬ 
sor,  and  that  has  more  accuracy  than  the  processor  would  ever  require.  The  uniformly 
spaced  representation  values,  corresponding  to  the  digital  codes,  are  approximated  by 
algebraic  integers  beforehand,  using  exhaustive  computer  routines.  These  approxima¬ 
tions  are  stored  in  tables,  which  are  accessed  by  the  digital  codes. 

Let  Z[6\  denote  the  set  of  algebraic- integer  representatives.  Increasing  levels  of 
accuracy  are  achieved  by  approximating  thq  uniform  representation  values  using  the 
points  of  Z [0]m  for  increasing  values  of  the  integer  M.  Each  element  of  Z[0]m  could  be 
associated  with  many  digital  codes  (if  M  is  small)  or  none  at  all  (for  larger  M).  The 
ultimate  accuracy  is  of  course  limited  by  the  accuracy  of  the  A/D  converter. 

This  approach  is  straightforward,  but  yields  suboptimal  quantization-error  per¬ 
formance.  For  a  fixed  integer  M,  the  representation  values  of  a  two-step  quantizer 
form  a  subset  of  Z[0]m .  The  decision  thresholds  of  the  quantizer,  however,  are  those 
of  the  uniform  quantizer  implemented  by  the  A/D  converter.  These  decision  thresh¬ 
olds  will  certainly  not  be  located  at  the  midpoints  of  the  intervals  between  successive 
representation  values,  which  is  a  necessary  condition  for  optimality.  Compare  this  to 
the  direct  algebraic-integer  quantizer,  where  the  representation  values  consist  of  all  the 
elements  of  Z[0\m  ,  and  the  decision  thresholds  are  set  at  the  midpoints  between  succes¬ 
sive  representation  values.  Section  3.4  quantifies  the  loss  in  performance  for  the  case  of 

Z  (v'2  +  n/2J. 
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3.3  COMPLEX  ALGEBRAIC-INTEGER  QUANTIZATION 


Complex  algebraic- integer  quantizers  correspond  to  two-dimensional  vector  quan¬ 
tizers  in  R2.  As  such,  the  encoding  problem  becomes  much  more  difficult.  One  obvious 
approach  is  to  quantize  the  real  and  imaginary  parts  of  the  complex  number  separately, 
combining  these  representations  to  form  the  complex  algebraic-integer  representative. 
This  approach  will  be  illustrated  for  the  ring  Z[e2T'/16],  the  real  numbers  of  which  form 
the  ring  Z[\/2  4-  \fl\.  Let  ui  =  e2T,//16. 

Suppose  z  =  x+yi  is  a  complex  number  to  be  approximated.  First,  approximations 
for  the  real  numbers  x  and  y  in  Z[y/2  +  \/2]\f  (represented  using  the  nonpolynomial 
basis)  are  computed: 

£  =  uo  +  ai%/2  +  v/2  +  aty/2  +  U3V^2  —  \/2,  (3.9) 

j/  =  f>0  +  &l\/2  +  y/2  4-  b 2  \/2  +  63  \J2  —  \/2.  (3.10) 

Then  z  =  x  +  yi  is  an  approximation  of  z  in  Z[u]. 

However,  i  is  represented  in  terms  of  the  basis 

B2  =  jl,  y/2~+y/2,  \/2,  \/2~— i,  \/2 1 ,  (3.11) 

instead  of  the  polynomial  basis  Bi  =  {1  ,w, . . .  ,u;7}.  The  change  of  basis  matrix  from 
B2  to  Bi  is 

(l  0  0  0  0  0  0  0\ 


0  0  0  1  0  -1 


0  0  0  1  0 
0  0  10  0 


(3.12) 


Thus,  when  expressed  in  terms  of  the  polynomial  basis,  z  is  in  Z [w]2m- 
The  inverse  of  (3.12)  is 


/I  0  0  0  0  0  0  0\ 

oi  0  0000^ 

00  \  000^0 

00  0  £0^00 

0  0  0  0  1  0  0  0 

00  o-ioioo 
0  0  0  00^0 
\0  -i  0  0  0  0  0  i/ 


(3.13) 
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The  fractions  in  the  second  matrix  indicate  that  B2  does  not  generate  all  of  Z[o;];  B2  is 
not  an  integral  basis  of  Z[u\.  In  fact,  if  Z[B2]a/  denotes  the  points  that  are  represented 
in  terms  of  the  elements  of  B2  with  coefficients  in  the  range  [— M/2,  M/2],  then,  as  a 
subset  of  R2,  Z[B2] Af  is  the  cross  product 

Z[B 2]m  =  Z[^2  +  y/2]U  x  Z[V2  +  V2]m 

=  {(a,4);a,66Z[v/2  +  v!]M})  (3.14) 

and  this  rectangular  cross  product  is  a  proper  subset  of  Z{uj\2M . 

No  rigorous  study  was  made  of  the  decrease  in  performance  incurred  when  real  and 
imaginary  parts  are  approximated  separately  by  the  above  technique.  It  is  reasonable 
to  expect,  however,  that  the  error  performance  of  Z[u >\m  in  R2  is  approximated  by  the 
error  performance  of  Z[\/ 2  +  a/2] A/  in  R.  (Indeed,  [6]  showed  that  this  is  the  case 
for  Z[e2jr‘/8]  and  Z[y/2]  for  a  region  of  R2  near  the  origin.)  Thus,  it  is  expected  that 
the  error  performance  for  Z[uj\m  should  be  roughly  comparable  to  Z[B2]a/.  But  the 
representatives  obtained  lie  in  Z[u\2M  ,  so  the  cost  of  the  increase  in  efficiency  is,  if  these 
assumptions  are  correct,  just  the  doubling  of  the  dynamic  range  of  the  inputs. 

If  doubling  the  dynamic  range  of  the  inputs  has  to  be  avoided,  then  approximations 
relative  to  the  basis  B3: 

{l,  >/2  +  V^/2,  V2/2,  \/2-y/2/2,i,  yjl  +  \fl  i/2,  x/2  i/2,  >/2- ^2i/2}  (3.15) 

can  be  computed  (the  roles  of  the  change  of  basis  matrices  (3.12)  and  (3.13)  are  in¬ 
terchanged).  However,  now,  Z(Ba]  contains  Z[u]  and  a  strategy  must  be  followed  to 
exclude  elements.  This  is  the  approach  of  [14]  ,  which  was  applied  in  the  case  Zfe2’"/8], 
but  is  equally  valid  here. 


3.4  QUANTIZATION  PERFORMANCE  FOR  Z[\/2  +  s/2] 


The  performance  of  a  quantizer  is  usually  evaluated  in  terms  of  signal-to-noise 
ratios.  More  precisely,  the  analog  signal  x  is  assumed  to  be  a  zero-mean  random  variable 
with  probability  density  function  px  and  variance  <72.  If  x  represents  the  quantized 
version  of  x,  then  the  mean  of  the  squared  error,  or  quantization-noise  variance  is  given 
by 


x)2p(t)  dt. 


(3.16) 


The  signal-to-quantization-noise  ratio  (SNR)  is  defined  as  10  logi0(cr2/cr2)  dB. 
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This  section  computes  the  SNR  for  quantizers  derived  from  Z[v2  +  \72]  using  the 
nonpolynomial  basis  {1,  \/2  +  \/2,  \/2,  \/2  —  \/2  }.  This  basis  results  in  lower  dynamic- 
range  growth  and,  if  desired,  leads  more  naturally  to  complex  representatives.  Both 
direct  and  two-stage  algebraic-integer  quantizers  are  considered  and  compared.  The 
two-stage  quantizer  is  assumed  to  employ  a  12-bit  uniform  quantizer  as  the  front  end. 

3.4.1  Uniform  Inputs 

In  this  section,  the  analog  input  x  is  assumed  to  be  uniformly  distributed  on 
the  interval  [—1,  1];  the  signal  variance  is  =  1/3.  For  comparison,  the  optimal 
uniform  quantizer,  which  has  decision  thresholds  extending  from  —1  to  1,  has  an  SNR 
of  6.02  dB/bit  [10]  .  The  algebraic- integer  quantizers  are  determined  by  specifying  the 
coefficient  range  A 4.  The  quantizers  obtained  extend  beyond  the  interval  [—1,  1]  and  so 
must  be  scaled  (in  this  case  by  a  fractional  amount)  to  best  match  the  uniform  input 
distribution.  A  similar  scaling  (by  an  easily  calculated  amount)  would  be  required  for 
the  uniform  quantizer  if  it  had  been  initially  matched  to  a  larger  variance.  In  the 
algebraic-integer  case,  the  proper  scale  factor  is  determined  by  computer  search. 

Figures  9  and  10  show  the  SNR  plotted  as  a  function  of  log10  of  the  scale  factor  for 
the  direct  and  two-stage  quantization  schemes,  respectively.  The  scale  factor  is  plotted 
using  the  logarithm  to  facilitate  relative  error  comparisons  for  nonoptimal  choices  of 
scale  factor.  These  plots  were  obtained  using  a  computer;  the  value  of  (3. 16)  can  be 
evaluated  in  closed  form  in  this  case. 

Figures  9  and  10  show  that,  for  both  methods,  the  SNR  rises  rapidly  as  the  scale 
factor  is  increased  from  a  very  small  value  (actually  the  initial  value  of  the  scale  factor 
is  .010),  reaches  a  peak  at  about  the  same  value  of  the  scale  factor  (actually  at  the  same 
value),  and  then  gradually  declines  as  the  scale  factor  is  increased  further  (the  largest 
value  of  the  scale  factor  considered  is  .550).  The  gradual  fall-off  of  the  curves  means 
that  the  quantizers  perform  near  optimally  over  a  wide  range  of  scale  factors. 

The  limiting  effect  of  the  12-bit  front  end  in  the  two-stage  method  is  evident  in 
figure  10.  The  curves  are  approaching  a  ceiling  of  72.24  dB — the  accuracy  of  the  12-bit 
uniform  quantizer.  However,  for  the  smaller  values  of  A/,  the  curves  appear  almost  to  be 
identical.  Table  3  gives  the  optimal  values  of  the  SNR  and  the  corresponding  value  of  the 
scale  factor.  For  M  <  8  (equivalently,  for  up  to  about  11  bits  of  accuracy),  the  methods 
perform  similarly.  Thus,  the  following  heuristic  is  indicated:  the  suboptimal  two-stage 
method  performs  practically  equivalent  to  the  optimal  direct  method  if  the  front-end 
A/D  converter  in  the  two-stage  method  is  chosen  with  one  bit,  or  more  conservatively 
with  two  bits,  of  accuracy  beyond  the  accuracy  requirements  of  the  processor. 

The  performance  of  the  quantizers  derived  from  the  polynomial  basis  is  within 
about  1  dB  of  the  values  in  table  3.  The  scale  factors,  however,  are  smaller,  reflecting 


29 


SK5KAL  TO  NOISE  RATO  n.  SCALE 


Figure  9.  Uniform  Inputs;  Z[\/2  +  \f2 ]m»  2  <  M  <  12; 
Nonpolynomial  Basis;  Direct  Quantizer 


the  fact  that  Z(y2  +  \/2 \m  expressed  in  terms  of  the  polynomial  basis  involves  larger 
numbers. 

It  is  useful  to  compare  the  efficiency  of  the  algebraic-integer  quantizers  with  the 
more  conventional  uniform  quantizers.  The  rate  of  a  quantizer  containing  P  points  is 
defined  as  log2  P.  With  this  definition,  ordinary  6-bit  quantizers  have  rate  b.  For  any 
quantizer  Q,  the  efficiency  relative  to  the  uniform  quantizer  is  defined  as  the  ratio 

equivalent  bit  performance  of  Q 
rate  of  Q 


(performance  of  Q  (dB))/6.02 
rate  of  Q 


(3.17) 


Table  4  lists  the  efficiencies  of  the  direct  algebraic-integer  quantizers.  Efficiencies  of 
about  90  percent  are  obtained  with  slightly  increased  efficiency  for  larger  values  of  M. 


30 


SIGNAL  TO  NOISE  RATIO  n.  SCALE 


SCALE 


Figure  10.  Uniform  Inputs;  Z[\/2  +  \/2]a/,  2  <  M  <12; 
Nonpolynomial  Basis;  Two-Stage  Quantizer 


Table  3.  Comparison  of  Direct  and  Two-Stage  Z[\/ 2  4-  V2]m 
Quantizers,  2  <  M  <  12;  Nonpolynomial  Basis;  Uniform  Inputs 


M 

Scale 

SNR  (dB) 

Direct 

Two-Stage 

2 

34.06 

4 

KB 

51.35 

51.31 

6 

61.09 

8 

KB 

68.94 

67.25 

10 

.090 

77.43 

71.09 

12 

.080 

83.45 

71.93 
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Table  4.  Efficiency  of  Z[\/ 2  +  y/2]\j,  2  <  M  <  12; 
Uniform  Inputs;  Direct  Quantizer 


M 

Bit  Perf. 

Rate 

Efficiency 

2 

5.66 

6.34 

.892 

4 

8.53 

9.29 

.918 

6 

10.09 

11.23 

.904 

8 

11.45 

12.68 

.903 

10 

12.86 

13.84 

.930 

12 

13.86 

14.80 

.937 

The  fractional  efficiencies  are,  of  course,  consistent  with  the  fact  that,  if  the  inputs  are 
uniformly  distributed,  the  uniform  quantizer  performs  the  best. 

The  algebraic  integers  form  a  distributed  representation.  For  example,  if  8- bit 
performance  is  desired,  then,  examining  table  4,  representations  with  coefficient  range 
M  =  4  can  be  used.  This  quantizer  yields  8.53  bits  of  accuracy  with  a  number  of  points 
that  corresponds  to  9.29  bits.  The  conventional  8-bit  quantizer  involves  a  single  value, 
ranging  from  —128  to  127  for  2’s-complement  representation.  The  algebraic-integer 
quantizer  involves  four  values,  each  ranging  from  —2  to  2. 

In  actual  systems,  the  variance  of  the  input  signal  is  often  not  known  exactly,  or 
is  changing  with  time  (assuming  there  is  even  a  fixed  probability  distribution).  In  such 
situations  it,  is  important  that  a  quantizer  perform  well  over  a  wide  range  of  possible 
input  variances.  To  compare  the  conventional  and  algebraic-integer  quantizers  on  this 
point,  the  efficiency  of  the  direct  algebraic-integer  quantizers  is  calculated  for  the  case 
that  the  input  is  assumed  to  be  uniformly  distributed  on  [—1,  1],  but  in  fact  the  inputs 
are  uniformly  distributed  on  [—1/2,  1/2]  (a  mismatch  in  variance  by  a  factor  of  1/4). 

The  performance  for  the  uniform  quantizer  in  this  case  degrades  by  exactly  one  bit, 
or  6.02  dB.  A  fr-bit  uniform  quantizer  on  [—1,  1]  with  inputs  from  [—1/2,  1/2]  performs 
like  a  (6—  l)-bit  quantizer.  The  performance  of  the  algebraic-integer  quantizer  is  given 
by  the  SNR  in  figure  9  at  a  value  of  twice  the  optimal  scale  factor  s0  (or  s0  +  .301 
on  the  log  scale).  These  values  for  M  =  2,  4,  . . . ,  12  are,  respectively,  31.10,  48.69, 
57.52,  65.82,  74.52  and  79.51.  Note  that  the  degradation  from  the  optimal  algebraic- 
integer  performance  given  in  table  3  averages  around  3.2  dB,  resulting  in  an  increased 
efficiency  for  this  variance  mismatch  case.  Table  5  lists  these  efficiencies,  which  now 
average  around  96  percent. 
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Table  5.  Efficiency  of  Z[\/2  +  2  <  A/  <  12;  Uniform  Inputs; 

Direct  Quantizer;  Variance  Mismatch  of  1/4 


M 

Bit  Perf. 

Rate 

Efficiency 

r  2 

6.16 

6.34 

.972 

4 

9.09 

9.29 

.978 

6 

10.55 

11.23 

.939 

8 

11.93 

12.68 

.941 

10 

13.38 

13.84 

.967 

12 

14.21 

14.80 

.960 

3.4.2  Gaussian  Inputs 

To  test  the  performance  of  the  algebraic-integer  quantizers  when  the  input  is 
nonuniformly  distributed,  this  section  assumes  the  analog  input  x  is  Gaussian  (or  nor¬ 
mally)  distributed.  Other  common  nonuniform  input  distributions,  especially  for  speech 
and  images,  are  Laplacian  and  Gamma  distributions,  but  they  will  not  be  considered 
here.  So  that  the  input  power  in  this  and  the  last  section  are  identical,  the  varance  is 
taken  as  —  1/3;  the  mean  is  still  assumed  to  be  zero.  For  reference,  table  b  usts  the 
SNR  for  the  optimum  uniform  quantizers  with  1  through  16  bits.  The  values  in  table  6 
for  1  through  8  bits  were  obtained  from  [10]  ,  page  127;  the  values  for  9  through  16  bits 
were  estimated — the  step  size  used  was  extrapolated  from  the  step  sizes  for  1  through 
8  bits  obtained  from  [10]  . 

Again,  the  algebraic-integer  quantizer  must  be  scaled  to  best  match  the  Gaussian 
input  distribution.  Figures  11  and  12  show  the  SNR  plotted  as  a  function  of  logi0  of 
the  scale  factor  for  the  direct  and  two-stage  quantization  schemes,  respectively.  These 
plots  were  obtained  using  a  computer;  the  value  of  (3.16)  in  this  case  must  be  evaluated 
numerically. 

Figures  11  and  12  show  that,  for  both  methods,  the  SNR  rises  as  the  scale  factor 
is  increased  from  a  very  small  value  (actually  the  initial  value  of  the  scale  factor  is 
.010),  reaches  a  peak  at  about  the  same  value  of  the  scale  factor,  and  then  declines 
more  gradually  as  the  scale  factor  is  increased  further  (the  largest  value  of  the  scale 
factor  considered  is  .562).  The  gradual  fall-off  of  the  curves  means  that  the  quantizers 
perform  near  optimally  over  a  wide  range  of  scale  factors. 

The  limiting  effect  of  the  12-bit  front  end  in  the  two-stage  method  is  evident  in 
figure  12.  The  curves  arc  approaching  a  ceiling  of  62.70  dB — the  accuracy  of  the  12-bit 
uniform  quantizer  for  a  Gaussian  input.  However,  for  the  smaller  values  of  A/,  the 
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Table  6.  SNR  of  Optimal  Uniform  Quantizer; 
Gaussian  Input 


Bits 

SNR  (dB) 

1 

4.40 

2 

9.25 

3 

14.27 

4 

19.38 

5 

24.57 

6 

29.83 

7 

35.13 

8 

40.34 

9 

46.03 

10 

51.54 

11 

57.10 

12 

62.70 

13 

68.33 

14 

74.00 

15 

79.68 

16 

85.39 

curves  appear  almost  to  be  identical.  Table  7  gives  the  optimal  values  of  the  SNR 
and  the  corresponding  value  of  the  scale  factor  for  the  direct  case  (the  scale  factor 
for  the  two-stage  approach  is  within  .03  of  this  value).  In  this  case,  there  is  a  slight 
performance  degradation  almost  immediately,  which  becomes  significant  by  the  time 
M  =  8.  However,  up  to  M  =  6,  which  from  examination  of  table  6  corresponds  to 
better  than  11  bits  of  accuracy,  the  performances  are  more  comparable.  This  confirms 
the  heuristic  derived  in  the  last  section:  the  suboptimal  two-stage  method  performs 
practically  equivalent  to  the  optimal  direct  method  if  the  front-end  A/D  converter  in 
the  two-stage  method  is  chosen  with  one  bit,  or  more  conservatively  with  two  bits,  of 
accuracy  beyond  the  accuracy  requirements  of  the  processor. 

It  is  more  difficult  in  this  case  to  determine  the  efficiency  of  the  algebraic-integer 
quantizers  relative  to  the  more  conventional  uniform  quantizers.  This  is  because  there 
is  no  simple  formula,  such  as  (6.02)(number  of  bits),  for  the  performance  of  the  opti¬ 
mal  uniform  quantizer  against  a  Gaussian  input.  The  equivalent  bit  performance  for 
the  algebraic-integer  quantizer  is  estimated  by  linearly  interpolating  between  the  perfor¬ 
mances  of  two  successive  bit  values  that  contain  the  performance  of  the  algebraic-integer 
quantizer.  Table  8  lists  the  efficiencies  of  the  direct  algebraic-integer  quantizers.  Effi- 
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Figure  11.  Gaussian  Inputs;  l[\J 2  +  \/2]m,  2  <  M  <  12; 
Nonpolynomial  Basis;  Direct  Quantizer 


ciencies  greater  than  1  are  consistent  with  the  fact  that  the  nonuniformly  distributed 
algebraic-integer  quantizers  better  match  the  Gaussian  input  distribution. 

For  example,  examining  table  8,  representations  with  coefficient  range  M  =  4  yield 
9.82  bits  of  accuracy  with  a  number  of  points  that  corresponds  to  9.29  bits.  These  9.29 
bits  are  distributed  among  four  coefficients,  each  ranging  from  —2  to  2. 


3.5  CONCLUSION 

Analog-to-algebraic-integer  conversion  was  considered.  The  focus  of  the  section 
was  on  real  algebraic  integers,  since  the  complex  case  is  further  complicated  by  the  need 
to  perform  vector  quantization  in  two  dimensions.  It  was  shown  how  separate  scalar 
quantization  of  the  I  and  Q  channels  could  be  combined  to  form  a  complex  algebraic- 
integer  representative,  expressed  in  terms  of  a  product  basis.  Other,  more  desirable, 
basis  representations  are  obtained  by  transforming  coefficients,  resulting  in  represen- 
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Figure  12.  Gaussian  Inputs;  Z[\/2  +  V2]m,  2  <  A1  <  12; 
Nonpolynomial  Basis;  Two-Stage  Quantizer 


Table  7.  Comparison  of  Direct  and  Two-Stage  Z[y/2  -f  y/2  )m 
Quantizers,  2  <  M  <  12;  Nonpolynomial  Basis;  Gaussian  Inputs 


M 

Scale 

SNR  (dB) 

Direct 

Two-Stage 

.3981 

31.60 

31.11 

.2512 

50.56 

49.21 

.1995 

59.47 

57.02 

8 

.1585 

65.87 

61.18 

10 

.1413 

78.43 

62.35 

12 

.1122 

82.21 

62.50 
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Table  8.  Efficiency  of  Z[\/2~4-"v/2]  a/ »  2  <  M  <  12; 
Gaussian  Inputs;  Direct  Quantizer 


M 

Bit  Perf. 

Rate 

Efficiency 

6.33 

6.34 

.998 

9.82 

9.29 

11.42 

11.23 

1.017 

8 

12.56 

12.68 

.991 

10 

14.78 

13.84 

1.068 

12 

15.44 

14.80 

1.043 

tatives  with  accuracies  that  are  conjectured  to  be  equivalent  to  accuracies  obtainable 
from  the  two-dimensional  vector  quantizers,  but  at  a  cost  of  twice  the  coefficient  range. 

Two  methods  that  perform  direct  real  algebraic-integer  conversion  were  described. 
The  first  method  required  the  implementation  of  the  exact  shape  of  a  nonlinear  function. 
The  second  method  also  involved  nonlinear  functions,  but  only  required  that  the  zero 
crossings  of  these  functions  be  implemented  accurately.  However  the  complexity  of  the 
second  method  grows  quickly. 

A  more  practical  two-stage  approach,  which  is  based  on  conventional  A/D  con¬ 
verter  technology  was  introduced.  Although’ this  approach  is  in  theory  suboptimal,  it 
is  practically  equivalent  (at  least  in  the  case  considered:  Z[\/2  +  \/2])  to  the  optimal 
direct  approach,  provided  that  the  processor  is  operated  at  one  to  two  bits  below  the 
accuracy  of  the  front-end  A/D  converter.  A  corollary  of  this  observation  is  that  further 
hardware  development  of  a  direct  analog-to-algebraic-integer  converter  is  not  warranted 
unless  the  application  requires  processing  at  accuracies  and  speeds  that  are  at  the  limits 
of  conventional  A/D  technology. 

Finally,  the  algebraic  integers  form  a  distributed  quantizer — a  single  large  repre¬ 
sentative  is  replaced  with  a  representative  consisting  of  a  vector  of  smaller  coefficients. 
For  the  same  level  of  accuracy,  the  total  cost  (measured  by  the  number  of  points) 
of  the  conventional  and  algebraic-integer  quantizers  is  comparable.  For  uniformly- 
distributed  inputs,  the  algebraic-integer  quantizers  (based  at  least  on  the  case  consid¬ 
ered:  Z[y/2  -|-  \/2])  averaged  about  91  percent  as  efficient  as  the  conventional  quan¬ 
tizers,  although  in  practice  the  robustness  of  the  algebraic-integer  quantizers  would 
tend  to  increase  this  efficiency.  For  Gaussian-distributed  inputs,  the  algebraic-integer 
quantizers  averaged  about  103  percent  as  efficient.  An  advantage  of  the  distributed 
representation  is  that  the  smaller  coefficients  reduce  the  complexity  of  the  integer-RNS 
conversion  into  and  out  of  the  outer  level  of  parallelism,  which  is  considered  next. 
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SECTION  4 


INTEGER  RNS  CONVERSION: 

THE  OUTER  LEVEL  OF  PARALLELISM 


In  an  algebraic-integer  RNS  implementation,  integer  RNS  is  applied  separately  to 
each  coefficient  of  the  algebraic  integer  to  form  the  outer  level  of  parallelism.  Con¬ 
verting  into  the  outer  level  of  parallelism  is  expected  to  be  straightforward  because  the 
distributed  algebraic-integer  representatives  have  small  coefficients,  which  are  likely  to 
be  smaller  than  the  moduli  in  the  RNS.  Converting  out  of  the  outer  level  of  paral¬ 
lelism  will  be  more  complicated,  although  the  complexity  will  be  mitigated  somewhat 
by  the  smaller  RNS  ranges,  resulting  from  smaller  input  ranges,  that  are  needed  to 
contain  the  dynamic  range  of  individual  coefficients.  As  is  the  case  for  integer  RNS, 
this  conversion  phase  could  still  very  well  be  the  bottleneck  in  an  algebraic-integer  RNS 
implementation. 

In  an  attempt  to  eliminate  this  bottleneck,  most  converters  have  either  taken 
advantage  of  specialized  sets  of  moduli  (e.g.,  the  set  {2n  —  l,2n,2n  +  1}  used  in  [16] 
and  [21]  ),  or  have  used  mixed-radix  conversion.  Unfortunately,  the  first  approach  is 
severely  limited  by  the  availability  of  appropriate  sets  of  moduli  for  algebraic- integer 
implementations,  where  the  moduli  have  to  satisfy  certain  restrictions  to  guarantee  the 
further  splitting  into  the  inner  channels.  The  second  approach,  mixed-radix  conversion, 
is  convenient  for  many  applications,  but  suffers  limitations  of  its  own.  One  of  these 
limitations  is  its  lack  of  flexibility  in  being  reprogrammed  for  different  moduli  as  it 
requires  0(n2)  modulo  adders,  which  are  not  easily  reconfigurable,  for  an  n-modulus 
RNS  conversion. 

This  section  looks  at  a  recently  proposed  method  for  rapid  output  conversion 
that  overcomes  some  of  these  difficulties,  called  fractional  representation.  The  next 
section  explicates  conversion  using  fractional  representation,  and  modifies  it  slightly  by 
replacing  lookup  tables  with  binary  multipliers.  Subsequently,  a  generalization  allowing 
adjustable  output  precision  is  developed  and  a  comparison  is  made  with  mixed-radix 
conversion.  It  will  be  shown  that  fractional-representation  conversion  is  fast,  requires  no 
custom  modulo  circuitry,  has  adjustable  output  precision  to  match  the  system  precision, 
and  is  adaptable  to  a  variety  of  moduli  combinations  even  after  being  designed  and  laid 
down  on  silicon. 
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4.1  FULL-PRECISION  OUTPUT  CONVERSION 

Given  an  RNS  {mi, m2,. . .  ,mn},  M  —  fjm,,  the  Chinese  Remainder  Theorem 
guarantees  that  any  integer  X  between  0  and  M  —  1,  inclusive,  can  be  reconstructed 
from  its  residues  X{.  This  is  done  by  using  the  formula  ([15]  ,  pp.  30-31), 


where  the  Wi  satisfy  the  equation, 


Constructing  an  output  converter  via  a  straightforward  application  of  (4.1)  is  problem¬ 
atic,  however,  as  the  large  reduction  modulo  A/  at  the  end  requires  custom  circuitry, 
which  is  more  costly  to  operate  than  an  ordinary  binary  adder  (figure  13). 


X 


Figure  13.  Residue  Decoding  by  the  Chinese  Remainder  Theorem 
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4.1.1  Fractional  Representation 


An  equivalent  way  to  write  equation  (4.1)  is1, 

*  =  £  —  <«**.•>.,  -  MK,  (4.2) 

^  771|  f 

i 

where  K  is  some  nonnegative  integer.  The  difficulty  in  using  (4.2)  lies  in  reducing  the 
sum  by  a  multiple  of  M  to  lie  in  the  range  [0,  M  —  1],  when  numbers  are  represented 
in  binary.  This  can  be  avoided  by  scaling  X  so  that  the  result  lies  in  the  range  [0,2). 
Multiplying  X  by  2/A/,  and  calling  the  result  Xa,  yields 

<«**.■>«,  -  2 K.  (4.3) 

“  771  j  * 

I 

In  this  way,  the  reduction  modulo  M  can  be  transformed  into  a  simple  discarding  of 
bits  above  the  unit  bit.  The  true  output  X  can  now  be  fully  recovered  by  multiplying 
Xa  by  M/2,  though  in  many  cases  a  scaled  version  of  the  output  X  will  suffice.  This  is 
an  important  point,  which  will  be  expanded  upon  in  section  4.2. 

The  remaining  consideration  in  computing  Xa  is  the  representation  of  the  summand 
ui  in  (4.3),  it,  =  ~7  ( WiXi)m ..  Notice  that  0  <  u,  <  2  and  the  fractional  part  of  u,  cannot 
be  represented  exactly  in  binary  unless  is  a  power  of  2.  Let  u,  be  approximated  by 
ui,  where 

Ui  =  r*«l  2"‘,  (4.4) 

and  the  ceiling  brackets  indicate  the  closest  integer  greater  than  or  equal  to  the  enclosed 
quantity  (the  following  analysis  could  be  easily  modified  to  accommodate  rounding  or 
truncation  to  arrive  at  the  approximations).  Now,  t +  1  bits  are  used  to  approximate  ut, 
with  1  bit  for  the  integer  part  and  t  bits  for  the  fractional  part.  From  equation  (4.4), 
it  can  be  seen  that  there  is  some  approximation  error  e;  such  that  =  u,  +  e^,  where 
e,  satisfies  the  inequality2  0  <  <  2~l.  Using  ui  instead  of  u,  in  (4.3)  introduces  an 

error  e  into  the  computed  value  of  Xa, 


n 

e  =  ei  <  n2~l.  (4-5) 


1  Except  where  noted,  the  following  material  up  to  (4.6)  is  condensed  from  [22]  . 

2The  authors  in  [18]  and  [22]  state  that,  once  the  RNS  is  fixed,  the  upper  bound  on  ei  can  be  made 
exact  by  an  exhaustive  search  of  all  m<  possible  fractions.  Actually,  the  exact  upper  bound  can  be  shown  to 
be 

e,  <  [l  -2m,n<’’I*+l}/mtJ  2"‘ 

where  q,  is  the  number  of  factors  of  2  contained  in  m,-. 
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Since  the  correct  possible  values  of  Xs  are  evenly  spaced  by  increments  of  2/A/,  exact 
reconstruction  is  possible,  by  rounding  down  to  the  next  correct  value,  if  the  error 
satisfies  the  inequality  e  <  2/A/.  By  the  bound  above  on  e,  the  inequality  is  satisfied 
when  n2”‘  <  2/Af,  or  equivalently 

<  >  [log2  A/n]  -  1  (4.6) 

(the  ceiling  brackets  are  included  to  guarantee  that  t ,  the  number  of  bits  representing 
the  fractional  part  of  ttj,  is  an  integer  as  required  in  any  practical  system).  This  means 
that  if  u,  is  calculated  by  a  lookup  table  accepting  xj  as  input  and  outputting  u,,  that 
lookup  table  must  store  t  +  l  =  [log2  A/n")  bits  for  each  possible  value  of  x,\ 


4. 1.1.1  Numerical  Example 

To  illustrate  how  this  procedure  works,  take  the  RNS  consisting  of  the  three  largest 
5-bit  representable  primes  —  23,  29,  and  31.  The  range  M  =  20,677  and  the  weights 
Wi  are  12,  12,  and  2,  respectively.  Suppose  X  =  9921.  Its  residues  are  xi  =  8,  x2  =  3, 
and  X3  =  1.  There  are  three  terms  in  the  sum  in  (4.3), 

Y u *  ~  23  (12  x  8^23  29  (12  x  3^29  +  5T  X  ^31  ’ 

»=i 

which  are  used  to  calculate  Xa.  As  indicated  in  (4.6),  the  u, s  are  approximated  by 
16-bit  numbers: 


0.010110010000110, 

0.011110111001100, 

0.001000010000101. 


Given  the  residues  as  input,  the  above  numbers  are  fetched  from  the  lookup  table  and 
added: 

«l  :  0.010110010000110 

u2  :  0.011110111001100 

+  u3  :  0.001000010000101 

Xs  :  0.111101011010111 


Converting  to  decimal,  Xs  =  0.959686  .  Multiplying  by  M/2  gives  the  result 

9921.72  •  •  •  ,  which  is  truncated  to  the  value  X  =  9921. 
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Notice  that  if  only  15  bits  had  been  used  for  the  approximation,  the  final  answer 
would  have  been  incorrect.  In  that  case,  the  sum  would  have  been 

0.01011001000011 
0.01111011100110 
+  0.00100001000011 

0.11110101101100  ’ 

or  Xa  =  0.959716  •  •  • ,  yielding  a  final  answer  of  9922. 

4.1.2  Modified  Fractional  Representation 

Xa  may  also  be  calculated  in  a  way  that  avoids  lookup  tables  or  RAMs  altogether. 
This  is  due  to  the  fact  that  the  inner  reduction  modulo  m,  in  (4.1)  need  not  be  per¬ 
formed,  i.e., 


This  equation  may  be  transformed,  via  the  same  process  used  previously,  into 

»  2 

Xa  =  y  — t OiXi  -  2 L,  (4.8) 

mi 

where  L  is  some  nonnegative  integer.  It  can  be  seen  that  the  summand  may  now 
be  computed  simply  by  multiplying  by  the  residue  xj.  Unfortunately,  the  first 

multiplicand  cannot  be  represented  exactly  in  binary,  unless  the  denominator  in  the 
fraction  when  reduced  to  its  simplest  form  is  a  power  of  2.  Call  the  first  multiplicand 
yi,  and  let  y,  be  an  (s  +  l)-bit  approximation  of  y,  such  that 

»<=  T2VI2-.  (4.9) 

As  0  <  yi  <  2,  the  first  bit  represents  the  integer  part  of  y,  with  the  remaining  s  bits 
representing  the  fractional  part.  The  approximation  of  y,  causes  an  additive  error  /,  to 
be  introduced  into  yj,  where3  0  <  fi  <  2“,.  Using  i>i  =  yiii  instead  of  u;  in  computing 
Xa  causes  a  cumulative  error  /, 

n  n 

/  =  Y^ftXi  <  2~a^(m,  -  1),  (4.10) 

t  I 

3The  additive  error  can  be  calculated  exactly  from  the  equation 

fi  =  [(-«'.  •  2’+1)mi  /m,]  2~\ 
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to  be  added  to  the  final  result.  By  the  argument  presented  earlier,  exact  reconstruction 
of  the  output  is  possible  if  the  error  /  is  less  than  2/M .  This  is  true  when 


n 


~1)-  2!M ’ 
t 


or  equivalently 


s  > 


log 2M^2(mi  -  1) 


-  1. 


(4.11) 


In  sum,  the  reconversion  may  be  performed  by  ordinary  binary  multipliers  that  multiply 
each  residue  x,  by  a  fixed  weight  of  s  + 1  bits.  As  the  result  has  only  s  bits  of  fractional 
accuracy,  and  as  all  bits  above  the  unit  bit  are  irrelevant,  only  the  lowest  s  +  1  bits  of 
the  product  are  needed  in  order  to  compute  Xa. 


4. 1.2.1  Numerical  Example  Revisited 

To  compare  the  above  with  the  lookup  table  implementation,  consider  the  example 
shown  previously.  In  this  instance,  there  are  again  three  terms  in  the  sum, 

3 

£«  =  <|-12)8  + (J;-12)3 +  (1-2)1, 

«=1 

used  to  calculate  Xa.  The  terms  in  parentheses  are  the  weights  y,.  By  (4.11),  these 
weights  must  be  approximated  by  numbers  of  at  least  21-bit  accuracy: 


ill  = 
V2  = 


h  = 


22° 

22° 

920 


24 

23 

-] 

29  I 

31 


2-20 

2-2° 

2-20 


1.00001011001000010111, 

0.11010011110111001100, 

0.00100001000010000101. 


The  y,  are  multiplied  by  the  residues  {8,  3,  1}  and  the  21  lowest  bits  of  each  product 
are  summed  to  form  Xa: 


y ixi  :  0.01011001000010111000 

y2x2  :  0.01111011100101100100 
-t-y3x3:  0.00100001000010000101 

Xa  :  0.11110101101010100001 
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Converting  to  decimal,  Xa  =  0.959626  •  •  • .  Seeding  up  by  M/ 2  yields  the  value 
9921.09-  •  •  ,  which  is  truncated  to  the  correct  result,  X  =  9921. 


4.2  ADJUSTABLE  PRECISION  OUTPUT  CONVERSION 

In  the  previous  section,  it  was  assumed  that  a  full-precision  integer  was  the  de¬ 
sired  output  of  the  residue-to-binary  converter.  This,  however,  is  not  always  the  case. 
Consider  for  a  moment  the  nature  of  an  integer-RNS  system.  Usually,  it  takes  some 
input(s)  upon  which  it  performs  additions  and  multiplications.  The  range  that  the 
resulting  output  can  lie  in  is  almost  certainly  larger  than  the  input  size,  and  in  most 
cases  is  considerably  larger. 

To  illustrate  this  claim,  suppose  an  FIR  filter  (a  situation  in  which  RNS  is  often 
used)  is  being  given  a  stream  of  8-bit  inputs.  Suppose  in  addition  that  there  are  128 
filter  coefficients,  each  with  8-bit  magnitude.  In  such  a  case,  the  outputs  could  be  as 
large  as  23  bits.  However,  the  number  of  significant  bits  could  range  anywhere  from 
16  bits  in  the  case  of  a  matched  filter,  to  8  bits  for  an  ordinary  bandpass  filter,  to  less 
than  that.  Even  in  the  worst  case  7  bits  of  output  precision  would  be  unnecessary  and 
could  be  discarded. 

There  is  some  reason  to  believe  that  higher  conversion  accuracy  m,  v  be  required  for 
algebraic-integer  RNS  applications,  but  it  is  not  necessarily  the  case  that  full  output 
precision  will  always  be  required  even  then  (for  example,  see  sections  6.1  and  6.2). 
The  reader  should  be  aware  that  the  amount  of  output  conversion  precision  is  a  more 
sensitive  issue  when  an  algebraic- integer  RNS,  as  opposed  to  an  ordinary  integer  RNS, 
is  employed. 

It  is  clear  from  the  preceding  rough  calculation  that  considerable  savings  might 
result  if  there  were  some  way  to  take  advantage  of  the  less-than-fuil  output  precision 
necessary  in  many  situations.  For  mixed-radix  conversion,  it  is  known  that  the  large 
adders  at  the  end  can  be  reduced  if  less  than  the  full  output  precision  is  desired,  but 
that  this  in  no  way  reduces  the  amount  of  computation  required  for  the  mixed-radix 
coefficients.  In  the  following  subsections  it  will  be  shown  that  the  hardware  required  in 
fractional-representation  conversion  scales  almost  linearly  with  the  number  of  output 
bits  desired. 


4.2.1  Fractional  Representation 

Suppose  it  is  decided  that  the  lowest  g  bits  of  the  output  from  the  residue-to-binary 
converter  are  unnecessary.  That  is,  instead  of  resolving  to  the  nearest  2/M  increment,  it 
is  decided  that  the  output  should  be  resolved  only  to  the  nearest  increment  of  29+l  /M. 
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This  implies  that  the  conversion  error  e  should  be  less  than  the  desired  resolution, 

Let  u,  be  a  (t  +  l)-bit  approximation  of  u,-,  as  before.  Then,  by  the  bound  on  e 
developed  in  equation  (4.5),  the  conversion  error  inequality  is  satisfied  when  n 2“l  < 
2 9+1  /M.  This  is  equivalent  to  requiring  that  the  following  inequality  hold: 

t>  \{\og2Mn)  -  g~\  -  l.  (4.12) 

As  the  number  of  bits  of  output  resolution  is  (log2  M)  —  g,  it  is  clear  that  the  number  of 
bits  of  accuracy  that  must  be  carried  (and  the  size  of  the  hardware  as  well)  goes  down 
linearly  with  output  resolution  bits  as  indicated  previously,  except  for  a  (log2  n)  —  1 
offset. 

Suppose  that  instead  of  discarding  the  lowest  g  bits,  the  highest  h  bits  were  thrown 
away.  This  could  occur,  for  instance,  if  it  were  known  that  the  output  didn’t  use  the 
full  range,  but  only  grew  as  large  as  some  fraction  of  M.  Specifically,  deciding  that  h 
high-order  bits  are  unnecessary  corresponds  to  knowing  that  the  integer  outputs  lie  in 
the  range  [0,2 ~hM).  Given  the  above  situation,  the  h  high-order  bits  can  be  dropped 
from  the  approximations.  Thus,  the  number  of  bits  coming  out  of  a  lookup  table  would 
only  have  to  satisfy  the  inequality 

t  >  f(log2  A/n)  —  h]  —  1.  (4.13) 

More  generally,  if  it  is  known  that  the  output  X3  will  lie  in  some  subrange  such  that 
for  nonnegative  integers  k  and  h,  Q  <  k  <  2h , 

Jfc2_fc+1  <  Xs  <  (fc  +  l)2"fc+1, 

then  the  highest  order  h  bits  of  the  output  Xs  are  unnecessary  and  may  be  dispensed 
with  in  the  approximations  of  the  summands  u,,  as  indicated  in  (4.13). 


4. 2. 1.1  Numerical  Example 

Consider  again  the  numerical  example  from  section  4.1.  Suppose  that  g  =  4.  Then, 
as  indicated  in  (4.12),  the  u,s  should  be  approximated  by  12-bit  numbers: 

2~n  =  0.01011001001, 

2~n  =  0.01111011101, 

2-11  =  0.00100001001. 


ui  = 

u2  = 

U3  = 


2n  •  — 
23 

211  ■  — 
29 

211  •  — 
31 
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Adding  these  numbers, 


ui  :  0.01011001001 

u2  :  0.01111011101 

+  u3  :  0.00100001001 

— ■  —  -■  1 

Xa  :  0.11110101111 

and  converting  to  decimal  yields  Xa  =  0.960449---.  Multiplying  by  M/2  gives 
9929.60  •  •  • ,  which  is  truncated  to  the  nearest  increment  of  29  =  16,  X  =  9920.  In 
practice,  the  last  step  of  generating  X  from  Xa  would  not  be  done.  Instead,  Xa  would 
simply  be  truncated  to  the  number  of  output  bits  desired  (presumably  log2  M  —  g  bits) 
and  the  result  would  be  the  converter  output. 


For  this  example,  t  +  1  was  16  bits  for  a  full-precision  output.  This  was  reduced  to 
12  bits  by  decreasing  the  converter  resolution.  Since  the  hardware  scales  linearly  with 
t  +  1,  this  implies  a  25%  reduction  in  hardware.  This  compares  to  a  28%  reduction 
(log2  M  vs.  log2  M  —  g)  in  the  number  of  bits  of  resolution. 


4.2.2  Modified  Fractional  Representation 


The  results  of  section  4.2.1  may  be  straightforwardly  extended  to  cover  modified 
fractional  representation  as  well.  Suppose  it  is  desired  that  the  output  Xa  be  resolved 
to  the  nearest  increment  of  2 9+1  [M.  Then  the  conversion  error  /  must  be  less  than 
that  increment.  Using  the  bound  on  /  from  equation  (4.10),  it  can  be  seen  that  this 
occurs  when 

n 

-  !)  <  2J+1/M, 

I 

which  is  equivalent  to 


s  > 


log2M  ^(m,  -  1)  )  -g 


-  I. 


(4.14) 


In  this  case,  except  for  the  offset  log 2  ]C(m*  —  l)i  the  hardware  again  scales  linearly 
with  the  fraction  of  output  precision  desired. 

In  the  same  fashion  as  in  section  4.2.1,  bits  can  be  dropped  from  the  high  end  of 
the  approximations  in  order  to  focus  in  on  smaller  subranges  of  the  output  range.  A 
subrange  of  size  h  bits  smaller  than  the  full  range  size  allows  the  number  of  bits  needed 
to  approximate  to  be  shrunk  such  that 


s  > 


log2  M^(m<  -  1) 


(4.15) 
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As  before,  only  the  least  significant  s  +  1  bits  of  the  product  yiXi  need  be  computed  to 
calculate  Xs. 


4.2.2. 1  Numerical  Example  Revisited 

Suppose  g  =  4,  as  previously.  Then  the  weights  yi  must  be  approximated  by 
numbers  of  17-bit  accuracy,  a s  indicated  in  (4.14): 


V\  = 


h  = 


h  = 


216  •  — 

2"16 

23 

216  •  — 

2"16 

29 

216  •  — 

2-ie 

31 

1.0000101100100010, 

0.1101001111011101, 

0.0010000100001001. 


Multiplying  the  yis  by  the  residues 
summed  to  form  X3: 

S/i*l  : 

fa*2  ■ 
+  fa *3  : 

X9: 


{8,3,1},  the  lowest  17  bits  of  each  product 

0.0101100100010000 

0.0111101110010111 

0.0010000100001001 

0.1 111Q101 101 10000 


are 


Converting  to  decimal,  X3  =  0.959716  •  •  • .  Scaling  by  M/2  yields  the  value  9922.03 
This  is  truncated  to  the  nearest  increment  of  16,  or  9920.  Again,  as  in  the  case  of 
ordinary  fractional  representation,  the  M/2  scaling  would  not  be  done  in  practice. 
Rather,  the  output  X3  would  be  truncated  to  the  desired  number  of  output  bits. 

Reducing  s  +  1  by  4  bits  caused  a  19%  reduction  in  hardware,  as  compared  to  a 
28%  reduction  in  output  resolution.  The  relative  hardware  reduction  is  smaller  for  the 
modified  fractional  representation,  as  the  offset  term  in  (4.14)  is  larger  than  that  in 
(4.12). 


4.3  COMPARISON  OF  CONVERSION  METHODS 

Mixed-radix  conversion  has  been  the  method  of  choice  for  RNS  output  conversion 
in  recent  years.  Therefore,  any  new  conversion  method  must  be  compared  with  it  to 
have  any  chance  of  gaining  widespread  acceptance.  Unfortunately,  direct  comparison  of 
conversion  methods  is  often  difficult  as  the  criteria  for  optimality  are  highly  dependent 
on  the  nature  of  the  rest  of  the  RNS  system.  In  addition,  such  factors  as  the  availability 
of  certain  technologies  and  hardware,  the  need  to  be  able  to  reprogram  the  converter 
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easily,  as  well  as  others,  may  play  an  important  role  in  any  comparison.  This  means  that 
absolute,  final  statements  as  to  which  method  is  better  cannot  be  made.  What  can  be 
done  is  to  compare  certain  key  features  and  highlight  various  trade-offs  in  order  to  help 
the  reader  judge  the  suitability  of  the  proposed  technique,  given  a  specific  situation. 

The  rest  of  this  section  attempts  to  do  such  a  comparison.  The  first  subsection 
briefly  compares  mixed-radix  conversion  with  ordinary  fractional  representation  con¬ 
version  by  looking  at  the  general  architectures  for  both,  and  by  comparing  converters 
designed  for  a  specific  RNS  using  the  same  technology.  The  next  subsection  shows  that 
there  are  reasons  for  using  modified  fractional  representation  above  and  beyond  mere 
area  considerations. 


4.3.1  Mixed  Radix  vs.  Fractional  Representation 

The  main  advantages  of  mixed-radix  conversion  are  that  it  requires  no  large  modulo 
M  reduction  at  the  end,  and  that  it  has  a  somewhat  regular,  pipelineable  structure. 
A  block  diagram  of  a  four-modulus  mixed-radix  converter  can  be  seen  in  figure  14. 
The  first  observation  one  could  make  about  the  structure  shown  is  that  the  area  seems 
to  divide  fairly  equally  between  the  computation  of  the  mixed-radix  coefficients  and 
the  summing  of  the  weighted  coefficients  at  the  end.  A  second  observation  is  that  the 
connectivity  is  somewhat  complex  —  there  are  several  long  wires  that  cross  others  on 
their  way  to  their  destinations. 

A  block  diagram  of  an  equivalent  fractional- representation  converter  can  be  seen  in 
figure  15.  It  is  quite  apparent  that  the  wiring  in  this  diagram  is  extremely  simple.  There 
are  no  long  lines  and  no  crossing  wires.  Another  observation  is  that  the  maximum  length 
path  in  the  mixed-radix  converter  is  longer  than  that  in  the  fractional  representation 
converter.  In  fact,  it  can  be  seen  that  all  paths  are  the  same  for  the  converter  in 
figure  15,  which  implies  that  it  is  more  efficient  in  some  sense  than  the  mixed-radix 
converter. 

There  was  much  discussion  previously  that  the  full  output  precision  of  the  par¬ 
ticular  RNS  under  consideration  was  not  usually  needed,  and  that  an  almost  linear 
reduction  in  hardware  for  the  fractional-representation  converter  could  be  achieved  by 
taking  advantage  of  this  and  eliminating  needless  bits.  What  are  the  properties  of  the 
mixed-radix  converter  given  the  same  situation?  It  can  be  shown  that  the  adders  at 
the  end  reduce  more  than  linearly  with  decreasing  output  precision  bits.  Offsetting  this 
quick  reduction,  however,  is  the  fact  that  no  reduction  is  possible  in  the  computation 
of  the  mixed-radix  coefficients.  The  reason  for  this  is  that  each  coefficient  is  needed  to 
compute  the  next  highest  one,  and  as  the  operations  involved  take  place  within  a  finite 
ring,  exact  representation  of  the  coefficients  is  always  necessary. 
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Figure  14.  Four-Modulus  Mixed-Radix  Converter 


A  better  idea  of  the  trade-offs  between  mixed-radix  and  fractional-representation 
conversion  in  VLSI  can  be  gotten  by  looking  at  a  specific  example.  In  1985,  Department 
D-82  of  the  MITRE  Corporation  designed  a  mixed-radix  converter  for  the  RNS 

M  =  {61,  53,  41,  37,  29}. 

Based  on  the  technology  used  in  that  design,  a  number  of  mixed-radix  and  ordinary 
fractional-representation  converters  were  designed  for  the  RNS  M.  with  varying  degrees 
of  output  accuracy.  It  was  found  that  the  active  VLSI  area  of  the  modified  fractional  - 
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Figure  15.  Four-Modulus  Fractional-Representation  Converter 


representation  converter  as  a  percentage  of  the  area  of  the  equivalent  mixed-radix  con¬ 
verter,  varied  from  90%  for  full  output  precision  (i.e.,  28  bits),  to  75%  for  12-bit  output 
precision,  and  approached  a  limit  of  20%  as  the  number  of  output  precision  K’ts  went 
to  zero.  This  area  advantage  occured  in  addition  to  the  regularity  advantag  of  the 
fractional  representation  architecture  mentioned  earlier. 


4.3.2  Advantages  of  Modified  Fractional  Representation 

The  full  power  of  the  fractional  representation  is  only  realized  when  it  is  modified 
to  allow  ordinary  (i.e.,  no  modulo  operations)  weighting  of  the  residues  in  the  recon¬ 
struction  formula.  Ordinary  weighting  enables  one  to  use  standard  binary  multipliers 
rather  than  modulo-specific  lookup  tables,  thereby  freeing  the  designer  from  the  task 
of  designing  a  new  converter  for  each  and  every  RNS.  Using  modified  fractional  repre- 
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sentation,  a  general  purpose  converter  can  be  built  which  can  be  programmed  for  the 
RNS  desired  merely  by  hardwiring  the  necessary  weighting  factors  to  the  inputs  of  the 
multipliers. 

A  VLSI  design  using  the  modified  fractional  representation  was  beyond  the  scope  of 
this  project.  However,  other  efforts  within  Department  D-82  have  yielded  very  nice  re¬ 
sults  in  that  direction.  A  systolic  array  that  performs  modified  fractional- representation 
conversion  was  developed,  in  which  each  array  cell  consists  of  a  small  adder  with  a  small 
amount  of  controlling  circuitry.  In  addition,  an  input  conversion  algorithm  was  found 
which  maps  directly  into  the  same  systolic  array,  yielding  a  regular,  programmable 
architecture  capable  of  RNS  I/O  conversion.  The  array  is  programmed  for  a  specific 
RNS  in  the  manner  described  above.  Since  each  cell  in  it  operates  as  fast,  if  not  faster 
than,  the  inner  RNS  operations,  the  systolic  array  converter  imposes  no  constraints  on 
the  throughput  of  the  overall  RNS  system.  A  paper  documenting  these  results  is  in 
preparation. 


4.4  CONCLUSION 

A  new  method  of  RNS  output  conversion,  fractional  representation,  has  been  ex¬ 
plained  and  a  slightly  modified  version  of  it  employing  binary  multipliers  rather  than 
lookup  tables  has  been  derived.  The  new  method  is  fast,  flexible,  and  yields  a  fairly 
simple  structure,  which  should  be  easily  realizable  in  hardware.  A  general  architec¬ 
ture  comparison,  as  well  as  a  VLSI  area  comparison  for  a  specific  example,  was  made 
between  mixed-radix  and  ordinary  fractional-representation  converters.  In  the  VLSI 
area  comparison,  the  area  of  the  fractional-representation  converter  was  always  less 
than  that  of  the  equivalent  mixed-radix  converter.  The  main  advantage  of  fractional 
representation  occurs  in  the  modified  version,  which  allows  a  general  output  converter 
to  be  designed.  Such  a  design  was  pursued  under  a  separate  effort.  A  systolic  array 
using  modified  fractional  representation  has  been  developed  which  can  also  perform 
RNS  input  conversion.  The  array  is  programmable  for  any  RNS  desired  (within  certain 
designer-chosen  limits)  and  operates  at  least  as  fast  as  the  inner  RNS  operations  that 
it  is  meant  to  be  used  with. 
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SECTION  5 


POLYNOMIAL  RNS  CONVERSION: 
THE  INNER  LEVEL  OF  PARALLELISM 


This  section  presents  an  architecture  and  VLSI  design  for  the  conversion  into 
and  out  of  the  inner  level  of  parallelism  of  an  algebraic-integer  RNS  implementation. 
The  conversion,  which  is  based  on  the  polynomial  version  of  the  Chinese  Remainder 
Theorem,  can  be  expressed  as  a  matrix  multiplication,  and  the  resulting  architecture 
will  be  based  on  the  LU  decomposition  of  the  conversion  matrix.  This  architecture 
applies  to  parallel  vector-matrix  multiplication  for  any  nonsingular  matrix,  and  not 
just  to  the  special  case  of  the  conversion  matrix. 

The  next  section  describes  the  inner-level  algebraic- integer  conversion  problem 
and  relates  it  to  LU  decompositions.  Subsequently,  the  architecture  for  the  conversion, 
called  the  LU  architecture,  is  derived  and  two  different  basic  cells  are  treated.  An 
alternative  systolic  architecture  to  perform  parallel  vector-matrix  multiplication  is  con¬ 
sidered  and  compared  with  the  LU  architecture.  Finally,  the  VLSI  design  to  perform 
the  LU  architecture  is  described. 


5.1  CONVERSION  AS  MATRIX  MULTIPLICATION 


5.1.1  Description  of  the  Conversion 

In  order  to  make  the  discussion  of  the  conversion  more  concrete,  it  is  assumed  that 
the  algebraic  integers  are  of  the  form 

ao  +  a\6  +  a2 9^  +  •  •  •  +  fln- i9n  (5-1) 

where  9  is  a  root  of  an  nth  degree  polynomial  /(x),  which  is  irreducible  over  the  integers 
and  is  called  the  minimum  polynomial  of  9.  Other  representations  for  the  algebraic 
integers  and  the  changes  that  are  needed  in  the  conversion  are  elaborated  on  at  the  end 
of  this  section. 

Let  the  RNS  be  composed  of  the  primes  pi,p2,  ■  ■  ■  ,Pm-  It  is  possible  to  extend  the 
results  of  this  section  for  some  cases  of  arbitrary  relatively  prime  moduli;  the  restriction 
of  primes  is  used  to  ease  the  exposition.  For  each  prime  p,  the  algebraic  integer  modulo  p 
must  be  converted  to  an  n-vector  of  entries  modulo  p,  where  n  is  the  degree  of  the 
extension.  Processing  is  then  performed  on  each  coordinate  independently;  this  is  the 
inner  level  of  parallelism.  Unless  otherwise  noted,  all  constants  in  this  section  will  be 
residues  modulo  some  fixed  prime  p. 
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An  algebraic  integer  of  the  form  of  (5.1)  modulo  a  prime  p  can  be  considered  as 
an  element  of  the  ring  Zp[x]//(x),  which  consists  of  polynomials  with  coefficients  in  Zp 
modulo  /(x),  i.e.,  x  satisfies  the  relation  /(x)  =  0.  The  prime  p  is  chosen  so  that 
f(x)  has  n  distinct  roots  modulo  p,  say  ao,Qi,...  ,  an_i,  and  so  /(x)  factors  as 

/(x)  =  (x-a0)(x  — ai)---(x  — an_i). 


From  the  Chinese  Remainder  Theorem  applied  to  polynomials,  there  is  the  following 
isomorphism: 

Zp[xl/(/(x))  -  Zp[x]/(x  -  a0)  x  •••  x  Zp[x]/(x  -  a„_i) 

=  Zp  x  •••  x  Zp. 


This  isomorphism  is  given  by  the  map 

<t>:  g(x)  i — ►  (ff(o-o),...  ,0(a«_i)).  (5.2) 

Each  component  of  <j>(g(x))  represents  one  of  the  inner  parallel  channels;  in  particular, 
there  is  an  n-fold  increase  in  parallelism. 

If  g(x)  is  represented  as  in  (5.1)  (with  9  replaced  by  x),  then  the  mapping  in  (5.2) 
is  the  following  modulo  p  vector-matrix  product: 


/  1 

i 

1  ^ 

ai 

<*n-l 

ai 

i 

n—  1 

=  ,g{an-i)). 

(5.3) 

W_1 

«r‘ 

^n— 1  i 

Qn-i/ 

That  is,  the  conversion  to  the  inner  level  of  parallelism  is  accomplished  by  a  matrix 
multiplication.  The  matrix  in  (5.3)  is  an  example  of  a  Vandermonde  matrix.  It  will  be 
denoted  by  V.  The  next  theorem  proves  that  the  matrix  V  is  invertible  (the  fact  that 
the  roots  are  distinct  is  needed  here).  Conversion  out  of  the  inner  levels  is  accomplished 
by  multiplication  by  V'-1. 

THEOREM  5.1.  The  matrix  V  is  invertible. 


PROOF:  For  i  =  0, . . .  ,  n  —  1,  define  the  following  polynomials: 


u,(x) 


rw*  -  ai) 

njy,(Q'  -  ®>) 
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(this  is  the  so-called  Lagrange  interpolation  method).  Then  Vi(ctj)  equals  1  if  i  =  j 
and  0  otherwise.  For  each  Uj(x),  let  Vj  be  the  row  vector  of  coefficients  of  v,(x)  written 
by  increasing  degree.  Then  VjV  =  (ui(aro), .  ■  • ,  i>i(an-i))  is  a  vector  with  a  1  in  the 
ith  spot  and  Os  everywhere  else.  The  matrix  whose  rows  are  given  by  the  v,  is  thus  the 
inverse  of  V .  | 

For  a  general  non-prime  modulus  q,  the  mapping  of  (5.2)  is  an  isomorphism  exactly 
when  the  corresponding  Vandermonde  matrix  is  nonsingular.  In  particular,  this  is  the 
case  if  the  determinant  of  the  matrix  is  a  unit  modulo  q.  A  formula  for  the  determinant 
of  a  Vandermonde  matrix  is  given  in  section  5.1.2. 

Note  that  (5.3)  is  essentially  a  transform.  Polynomial  multiplication  or  convolution 
on  the  left  side  becomes  coordinate-wise  multiplication  on  the  right  side.  Hence,  one 
advantage  of  the  inner  level  of  parallelism  is  that  it  greatly  simplifies  the  complicated 
algebraic- i nteger  mult i plication . 


5.1.2  LU  Decomposition  of  the  Vandermonde  Matrix 


The  architecture  to  perform  the  conversion  is  based  on  the  LU  decomposition  of 
the  Vandermonde  matrix  V.  A  matrix  is  LU- decomposable  if  it  can  be  factored  as 
the  product  of  a  lower-triangular  matrix  L  times  an  upper-triangular  matrix  U .  Some 
elementary  properties  of  LU  decompositions  are  given  in  appendix  A. 


Suppose  first  that  V  factors  into  two  n  x  n  matrices,  say  V  =  VjfA.  Since  V 
is  invertible,  V\  must  be  invertible.  Each  row  of  Vj-1  is  identified  with  the  vector  of 
coefficients  of  some  polynomial  (the  coefficients  written  by  increasing  degree).  With 
this  identification, 

/—  m0(x)  —\ 

Vf1  = 

\—  mn_i(x 


—  mi(z)  — 

.  .  i 


(5.4) 


If  m^  is  the  vector  of  coefficients  for  m,(x),  then  equations  (5.2)  and  (5.4)  give 


(m,(ao),...  , m,(an-i))  =  <f>(mi(x)) 

=  m,V 
=  m,  V\  Vo 
=  ith  row  of  V*. 
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That  is, 


V2 


/  mo(ao)  mo(ai) 
I  mi(ao)  mi(ai) 


m0(a„_i)  \ 
mi(an_i)  I 


\m„_i(a0)  •••  mn_i(an_i)/ 


(5.5) 


Conversely,  if  ,  mn_i(x)}  is  any  basis  for  Zp[x]  modulo  /(x),  then  the 

corresponding  matrices  V\  and  V>  from  (5.4)  and  (5.5),  respectively,  give  a  factorization 
of  V;  V\  is  a  change  of  basis  matrix  and  V2  is  the  matrix  for  the  map  in  (5.2)  in  this 
new  basis. 

When  do  V\  and  V2  give  an  LU  decomposition?  If  V\  is  lower-triangular,  then 
Vf1  is  also  lower-triangular,  and  in  particular  the  degree  of  m,(x)  is  i.  If  V2  is  upper- 
triangular,  then  mj(o'j)  is  zero  if  j  <  i,  or  equivalently,  (x  —  ctj)  divides  m,(x).  These 
observations  lead  to  the  following  definitions: 


to(x)  =  1 

ti(x)  =  x  —  qo 

t2(x)  =  (x  -  c*o)(x  -  ai) 


*»-l(*)  =  JI  (X  -  Q})  . 
}<n-r  1 


(5.6) 


Define  V\  and  V2  as  in  (5.4)  and  (5.5)  using  these  <j(x)  polynomials;  this  gives  the  LU 
decomposition  of  V .  Furthermore,  up  to  a  constant  factor,  they  are  the  only  polynomials 
which  do  so  (Theorem  A. 3).  These  matrices  are  denoted  by  L  =  V\  and  U  =  V2.  The 
LU  decomposition  of  a  Vandermonde  matrix  is  presented  in  a  different  manner  in  [9] 
along  with  additional  references. 

As  an  aside,  this  factorization  gives  for  free  the  well-known  formula  for  the  de¬ 
terminant  of  a  Vandermonde  matrix.  The  determinant  of  V  is  the  product  of  the 
determinants  of  L  and  U.  The  determinant  of  L  is  1,  since  L-1  is  lower-triangular  with 
Is  on  the  diagonal  (the  coefficient  of  x*  in  t*(x)  is  1).  Thus,  the  determinant  of  V  is  the 
determinant  of  U,  which  in  turn  is  the  product  of  its  diagonal  elements.  That  is, 

n— 1 

det(V)  =  *,(<*«)  =  JJ(ai-a_,). 

i=0  j<i 
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5.1.3  An  Example:  Z[\/2  +  \/2],  p  =  31 

Let  9  =  \/2  +  \pi.  The  minimum  polynomial  for  9  is  /(x)  =  x4  —  4x2  +  2,  and  a 
small  calculation  shows  that 

x4  —  4x2  4-  2  =  (x  —  5)(x  —  14)(x  —  17)(x  —  26)  (mod  31). 

Thus  the  isomorphism  of  (5.2)  exists.  The  corresponding  Vandermonde  matrix  is 


/  1  1  1  1  \ 

5  14  17  26  I 

I  25  10  10  25  I 

V  1  16  15  30/ 


The  inverse  of  V  can  be  found  by  expanding  the  following  four  polynomials  (where, 
for  the  remainder  of  this  section,  all  calculations  are  modulo  31): 


1 


(5  -  14)(5  -  17)(5  -  26) 
1 


(14  -  5)(14  -  17)(14  -26). 
1 

(17  —  5)(  17  —  14)(  17  —  26) 

1 


(x-  14)(x-  17)(x-26) 
(x  -  5)(x  -  17)(x  -  26) 


(x  —  5)(x  —  14)(x  —  26) 


(26  -5)(26  -  14)(26  -  17) 


(x  —  5)(x  —  14)(x-  17) 


That  is, 


V1  = 


10  2  30  6  \ 

6  27  1  20 

6  4  1  11 

10  29  30  25/ 


In  order  to  calculate  the  LU  decomposition  of  V,  define  the  four  polynomials: 


<o(x)  =  1 

t  i(x)  =  x  —  5 

t2(x)  =  (x  -  5)(x  -  14) 

<3(x)  =  (x  -  5)(x  -  14)(x 


=  1 

=  x  +  26 
=  x2  +  12x  +  8 
17)  =  x3  +  26x2  +  21x  +  19. 
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Then  V  —  LU ,  where 


/  1  0  0  0\ 

j —\  26  1  0  0  | 

L  ~  8  12  1  0  I 

\  19  21  26  1/ 

(using  the  ti(x )  in  (5.4))  and 


U 


(l  1 

[  0  9 

I  0  0 
\0  0 


1 

12 

5 

0 


1 

21 

4 

5 


(using  the  ti(x)  in  (5.5)).  These  in  turn  imply  that 


L  = 


/  1  0  0  0\ 
f  5  10  0  1 

I  25  19  1  0  I 

V  1  12  5  1/ 


and 

(1  24  29  6  \ 

0  7  8  20 

0  0  25  11  - 

0  0  0  25/ 

The  latter  two  matrices  are  given  for  completeness;  they  will  not  be  needed  in  the 
architecture  to  be  described  at  the  end  of  section  5.2. 

This  and  other  examples  are  given  in  appendix  B. 


5.1.4  Change  of  Basis 

An  algebraic  integer  may  be  naturally  represented  in  several  ways.  For  example, 
the  elements  of  the  ring  Z[\/ 2  +  \/2]  of  real  numbers  in  the  ring  of  algebraic  integers 
determined  by  the  16th  roots  of  unity  can  be  represented  as 


*y  j 

(ao,  ai ,  a2, 03)  < - ♦  ao  +  a\ y/2  +  y/2  +  02  (\/‘2  +  \/2j  +  03  (y/ 2  +  \/2j 


or 


(60,  bi,bo,  63)  ♦ - *  bn  -f  61  \/2  +  \J2  +  b2\Pl  +  63 \J 2  —  \/2  . 


58 


It  is  the  first  representation  that  coincides  with  (5.1)  and  corresponds  to  the  example 
in  section  5.1.3.  The  two  representations  are  related  by  a  change  of  basis  matrix: 


(  1  0  0  0  \ 

-2  0  1  0  ]  =  ’a3)‘ 

0  -3  0  1  / 

Call  this  change  of  basis  matrix  T.  Then  the  vector  (6o,...  ,63)  is  converted  into 
the  inner  channels  by  multiplying  by  TV,  where  V  is  the  Vandermonde  matrix 
from  (5.3).  Since  T  is  lower-triangular  and  V  is  LU-decomposable,  the  matrix  TV 
is  LU-decomposable.  Therefore,  the  architecture  to  be  developed  in  section  5.2  would 
apply  also  to  the  matrix  TV  and  thus  to  the  second  representation. 

In  general,  given  another  representation  for  the  algebraic  integers  besides  the  stan¬ 
dard  representation  of  (5.1),  there  is  a  change  of  basis  matrix  T  that  sends  this  represen¬ 
tation  into  the  standard  representation.  This  matrix  is  integral  with  determinant  ±1, 
and  so  in  particular  is  invertible  modulo  any  prime  p.  The  conversion  of  algebraic  inte¬ 
gers  in  the  new  representation  into  the  inner  channels  is  accomplished  by  multiplying 
by  TV,  where  V  is  the  Vandermonde  matrix  from  (5.3).  If  TV  is  LU-decomposable, 
then  no  additional  changes  need  be  made;  the  resulting  matrices  L  and  U  can  be  used 
in  the  architecture  of  section  5.2. 

If  TV  is  not  LU-decomposable,  then  there  is  some  permutation  matrix  P  such 
that  PTV  is  LU-decomposable  (see  Theorem  A. 2  in  appendix  A).  Multiplying  by  the 
matrix  P  induces  a  reordering  of  the  coordinates  in  the  representation,  and  thus  requires 
no  additional  overhead.  That  is,  conversion  becomes  the  calculation 

((no,-..  ,a„_i)PT)  PTV , 

where  multiplying  by  PT  just  permutes  the  a;.  The  same  observation  applies  to  recon¬ 
version. 


5.2  THE  LU  ARCHITECTURE 

In  this  section,  an  architecture  is  developed  that  performs  the  conversion  and 
reconversion  of  a  vector  via  a  matrix  multiplication.  The  architecture  applies  whenever 
the  matrix  is  LU-decomposable  and  is  called  the  LU  architecture.  The  principle  used  in 
the  computation  is  a  combination  of  forward  elimination  and  back  substitution  (see  [9) 
).  The  architecture  is  also  motivated  by  the  desire  to  have  parallel  inputs  and  outputs. 

The  LU  architecture  assumes  knowledge  of  the  LU  decomposition;  as  seen  in  the 
last  section,  it  is  possible  to  precompute  the  LU  decomposition  for  the  the  case  of 
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inner-level  conversion  and  reconversion.  In  other  applications,  determining  the  LU 
decomposition  may  be  more  the  central  issue,  and  the  present  architecture  should  not  be 
confused  with  methods  that  accomplish  this.  This  section  develops  the  LU  architecture 
in  a  general  manner,  illustrating  it  using  the  specific  task  of  inner-level  conversion  and 
reconversion. 


5.2.1  Data  Flow 


Suppose  that  the  vector  x  =  (xo,...  ,in-i)  is  converted  to  the  vector  z  = 
(20,...  ,  2»j_i)  using  an  LU-decomposabld  matrix.  Then  for  the  appropriate  lower- 
triangular  matrix  L  and  upper-triangular  matrix  U, 


xLU  =  z . 


(5.7) 


Reconversion  from  z  to  x  is  the  matrix  product 


zU-'L-1  =  x. 


(5.8) 


Define  y  =  (yo,  •  •  •  ,  yn-i)  as  xL  (=  zU  !).  This  y  is  in  some  sense  a  halfway  point  in 
the  conversion  from  x  to  z  (or  the  reconversion  the  other  way). 

Although  the  results  in  this  section  will  be  discussed  for  general  LU  decomposi¬ 
tions,  the  specific  example  of  the  Vandermonde  matrix  of  the  last  section  should  be 
kept  in  mind.  In  that  case,  the  vector  x  represents  an  algebraic  integer  in  polynomial 
notation  (5.1).  The  vector  z  represents  residues  (from  the  polynomial  version  of  the 
Chinese  Remainder  Theorem  (5.2)).  As  for  y,  it  is  the  vector  of  coefficients  of  the  U(x) 
from  (5.6).  The  ti(x)  are  products  of  an  increasing  number  of  irreducible  polynomials, 
i.e.,  the  (x  —  a^),  and  so  are  analogous  to  the  successive  products  of  primes  in  mixed- 
radix  conversion  in  ordinary-integer  RNS;  the  vector  y  then  corresponds  to  mixed-radix 
coefficients.  (This  analogy  can  be  made  rigorous;  it  is  based  on  the  similar  algebraic 
properties  of  Z  and  Zp[x].) 

The  overall  structure  of  the  LU  architecture  is  a  grid  of  basic  cells  that  uses  the 
fact  that  since  L  is  lower-triangular,  y,  depends  only  on  x,, ...  ,  x„_],  or  equivalently, 
on  Xi,yj+i,...  ,yn-i-  As  each  y,  is  calculated  by  forward  elimination,  it  is  then  fed 
into  the  calculation  of  yo, . . .  ,  yi- 1  and  2,, . . .  ,  zn_  1.  The  latter  are  computed  by  back 
substitution.  The  flow  of  variables  is  shown  in  figure  16.  Values  pass  from  top  to  bottom 
and  calculations  take  place  at  the  arrowheads.  In  particular,  the  grid  in  figure  16  is 
5  x  4.  This  process  will  be  made  more  apparent  in  the  next  subsection. 

The  reconversion  is  based  on  similar  observations  between  the  y;  and  the  2,.  Since 
U~l  is  upper-triangular,  y,  depends  on  20,...  ,2,,  i.e.,  on  2,,  yo, ...  ,  y,_  j.  Again  after 
being  calculated  by  back  substitution,  each  y,  is  then  used  to  calculate  yl+i, . . .  ,  yn-\ 
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and  xo,...  ,  x,\  This  flow  of  variables  is  shown  in  figure  17.  Observe  that  though  the 
underlying  grid  is  identical  to  that  in  figure  16,  the  vectors  are  written  in  reverse  order. 

The  LU  architecture  can  be  applied  to  matrix-matrix  multiplication  by  pipelining 
the  input  vectors.  The  only  requirement  is  that  one  of  the  matrices  be  nonsingular. 

As  already  stated,  in  order  to  complete  the  architectures  in  figures  16  and  17, 
some  type  of  computation  is  performed  at  the  arrowheads.  These  locations  are  called 
cells.  The  function  of  these  cells  is  the  choice  of  the  designer  and  could  depend  on 
implementation,  etc.  The  next  two  subsections  each  deal  with  one  type  of  cell. 
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X 


xo  x  1 


Figure  17.  Flow  of  Variables  in  UL  Grid  (n  =  4) 
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5.2.2  The  A  +  BC  Cell 


The  first  cell  is  shown  in  figure  18.  The  operation  of  the  cell  is  a  multiply  and 
accumulate.  There  are  two  inputs,  A  and  B ,  which  are  used  to  produce  the  result 
A  +  BC ,  where  C  is  a  constant  assumed  fixed  for  the  cell,  i.e.,  stored  in  the  cell.  In  the 
algebraic-integer  conversion,  the  calculation  is  carried  out  modulo  some  fixed  prime  p. 


A  B 


A  +  BC 


Figure  18.  The  A  +  BC  Cell 


The  following  observation  from  appendix  A  is  needed:  if  a  matrix  has  LU  decom¬ 
position  LU ,  then  for  any  nonsingular  diagonal  matrix  D ,  ( LD~l)(DU )  is  also  an  LU 
decomposition.  In  this  way,  the  diagonal  of  L  or  U  can  be  prescribed  as  required. 
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5. 2. 2.1  Architecture  for  LU 


The  matrices  of  the  LU  decomposition  are  written  so  that  the  diagonal  of  L  is  all 
Is  (and  so  the  diagonal  of  L~x  is  also  all  Is).  To  ease  the  exposition,  let  n  =  4  and  let 

(  1  0  0  0\ 

*io  1  0  0  | 

*20  *21  1  0  I 

*30  *31  *32  1  / 

and 

/“00  “01  “02  “03  \ 

TJ  __  0  U11  U\2  “13  I 

0  0  1422  “23  I 

\  0  0  0  U33  / 

Since  y L~l  =  x, 

2/3  =  *3 

2/2  =  X2  -  *322/3  9) 

yi  —  X\  —  *312/3  —  *212/2 

2/0  =  Xo  —  *302/3  ~  *202/2  -  *102/1  • 

These  equations  determine  the  constants  for  the  triangle  of  cells  above  the  off  diagonal 
(see  figure  19).  The  remaining  cells  are  filled  with  the  appropriate  u,y;  this  follows  from 
the  equation  z  =  y U .  By  choosing  the  diagonal  of  L  to  be  all  Is,  the  coefficient  of  x, 
is  also  1,  and  so  the  first  row  of  figure  16  can  be  eliminated. 

In  the  example  of  section  5.1.3,  the  matrices  L~x  and  U  are  already  in  the  required 
form.  The  resulting  grid  with  constants  is  shown  in  figure  20. 
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5. 2. 2. 2  Architecture  for  UL 


The  matrices  of  the  LU  decomposition  are  written  so  that  the  diagonal  of  U  is  all 
Is.  Continuing  with  n  =  4,  let 

(too  0  0  0  \ 

iio  hi  0  0  I 

ho  ^21  i-22  0  I 

hd  ^31  h2  hi  J 

and 

(1  1*01  U02  U03  \ 

0  1  ttl2  «13  | 

0  0  1  U23  I  ' 

0  0  0  1  / 


(Of  course,  the  and  utJ  may  be  different  from  those  in  the  previous  section.) 


Since  y U  =  z, 

yo  = 

yi  =zi-  txoiyo 

y2  =  22  -  «02y0  -  «12yi 

V3  =  23  —  U03P0  —  «13yi  -  U23V2  ■ 


(5.10) 


These  equations  determine  the  constants  for  the  triangle  of  cells  above  the  off  diagonal 
(see  figure  21).  Notice  that  the  vectors  are  reversed  from  figure  19.  The  remaining  cells 
are  filled  with  the  appropriate  i{} ;  this  follows  from  the  equation  x  =  y L  .  Again,  the 
first  row  of  cells  in  figure  17  was  eliminated  by  the  choice  of  diagonal  for  U. 

In  the  example  of  section  5.1.3,  the  matrices  L~x  and  U  must  be  changed  so  that 
the  diagonal  of  U  is  all  Is.  The  new  matrices  are 


/  1  0  0  0  \ 

27  7  0  0  | 

14  21  25  0  I 

V 10  29  30  25  / 


and 


Z1 

l 

1 

1  \ 

0 

i 

22 

23 

0 

0 

1 

7 

Vo 

0 

0 

1  / 

The  resulting  grid  is  shown  in  figure  22. 
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5.2.3  The  C(A  -  B)  Cell 


The  A  +  BC  cell  from  above  gives  a  square  grid  for  any  LU  decomposition.  Because 
of  the  analogy  with  ordinary-integer  mixed-radix  conversion,  another  cell  seems  natural, 
namely  one  which  does  a  subtraction  and  then  a  multiplication.  This  is  shown  in 
figure  23.  The  inputs  A  and  B  are  first  subtracted  and  the  result  is  multiplied  by  a 
(stored)  constant  C. 


A  B 


C(A  -  B) 


Figure  23.  The  C(A  -  B)  Cell 


Unfortunately,  this  cell  does  not  lead  to  a  well-defined  architecture  for  general  LU 
decompositions.  Consider  the  equations  in  (5.9),  specifically 

Vl  =  xi  -  f3Iy3  -  i2iy2  . 

In  order  to  put  this  in  the  grid,  there  would  need  to  be  constants  (3}  such  that  the 
following  equation  holds 

y\  =  #j(  #>(  /?l(xi  -  0)  -  y3  )  -  y2  ) .  (5.11) 
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Equating  coefficients  yields  the  equations 


=  1 

0302  = 

03  ~  ?21  • 

In  particular,  02  =  ^31  / ^21  •  But  there  is  no  guarantee  that  £21  is  nonzero.  The  same 
problem  occurs  with  the  equations  for  reconversion  (5.2).  One  can  attempt  to  change 
the  order  of  operations  in  (5.3)  or  to  change  the  diagonal  of  the  respective  matrix, 
but  the  same  criticism  will  still  hold.  In  this  way,  there  is  no  well-defined  method  to 
assign  constants  to  the  cells  for  an  arbitrary  LU  decomposition  (for  either  conversion 
or  reconversion). 

In  the  algebraic-integer  conversion,  there  are  such  constants  0j  for  the  U  matrix 
because  the  i,j  entry 

n<*i  ~ a*) 

*<1 

factors  in  a  natural  way.  This  also  can  be  seen  from  the  analogy  with  ordinary-integer 
mixed-radix  conversion.  The  lower-triangular  matrix  though  does  not  work  in  general. 
It  is  straightforward,  however,  to  combine  the  C{A  -  B)  cells  for  U  with  A  +  BC  cells 
for  L.  The  resulting  grid  for  the  UL  case  is  analogous  to  ordinary-integer  mixed-radix 
conversion.  These  hybrid  grids  though  are  undesirable  because  of  the  lack  in  regularity; 
this  is  especially  so  since  all  additions  and  multiplications  are  modulo  the  same  prime. 
Therefore,  the  A  +  BC  cell  is  the  better  choice. 

The  grids  using  the  C(A  —  B)  cell  are  not  shown;  if  they  exist  they  would  be  similar 
to  figures  19  and  21. 


5.3  SYSTOLIC  CONVERSION 

The  LU  architecture  developed  in  section  5.2  computes  in  parallel  a  modulo  p 
vector-matrix  product,  but  only  if  the  matrix  is  LU-decomposable  (equations  (5.1) 
and  (5.2)).  By  using  Theorem  A. 2  in  appendix  A,  this  architecture  applies  to  an 
arbitrary  nonsingular  matrix  after  an  appropriate  permutation  of  the  coordinates  of 
the  incoming  vector.  It  is  natural  though  to  examine  other  methods  to  compute  vector- 
matrix  products,  and  one  common  approach  is  to  perform  the  computation  via  a  systolic 
architecture.  A  complete  treatment  of  systolic  architectures  is  beyond  the  scope  of 
this  paper.  Rather  the  LU  architecture  is  modified  into  a  systolic  architecture.  The 
architectures  are  linked  by  the  common  use  of  the  .4  -f  BC  ceils. 
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5.3.1  The  Systolic  Architecture 


A  systolic  architecture  is  characterized  by  computation  taking  place  in  a  set  of 
locally  interconnected  simple  cells.  It  is  desirable  for  the  computation  to  be  pipelined 
so  that  the  cells  are  in  constant  use.  The  LU  architecture  is  semi-systolic;  it  is  fully 
pipelineable  and  the  operations  are  performed  in  simple  cells.  Information,  however, 
must  be  made  available  globally  via  long  buses.  The  goal  then  is  to  eliminate  these 
long  buses.  For  a  general  introduction  to  systolic  architectures,  see  [11]  . 

It  should  be  stressed  that  for  certain  applications  there  are  more  suitable  architec¬ 
tures  than  those  discussed  here.  For  example,  if  the  n  inner  channels  were  implemented 
with  a  single  channel  running  n  times  as  feist,  the  n  coordinates  would  be  available 
sequentially.  In  this  case,  a  linear  systolic  array,  requiring  only  n  as  opposed  to  n~  cells, 
may  be  more  appropriate. 

If  the  inputs  and  outputs  are  to  be  processed  in  parallel,  then  systolic  arrays 
to  perform  matrix-matrix  multiplication  are  required.  There  are  a  variety  of  systolic 
architectures  that  perform  matrix-matrix  multiplication  involving  a  host  of  basic  cells 
and  data-flow  strategies  ([17]  ).  When  the  restriction  is  made  to  use  the  A  -f  BC 
cell  described  in  section  5.2  (figure  18)  and  a  rectangular  data  flow,  then  the  systolic 
architecture  of  this  section  is  suggested. 

The  systolic  architecture  is  described  for  n  =  4,  and  is  easily  applied  for  general  n. 
Suppose  that  the  goal  is  to  compute  the  vector-matrix  product  xV\  i.e., 


(xq,Xi,X2,X3) 


Voo 

voi 

V02 

^03  \ 

VlO 

vu 

V\2 

t>13  ' 

V20 

V21 

V22 

V23  I 

V30 

V31 

^32 

V33  / 

The  flow  of  data  is  shown  in  figure  24.  Each  matrix  element  resides  in  a  basic  cell,  so 
there  are  n2  cells.  The  inputs  enter  from  the  left  and  are  staggered.  They  are  passed 
unchanged  along  the  horizontal  arrows.  Each  input  is  multiplied  by  the  matrix  element 
and  the  result  accumulated  and  passed  down  to  the  next  cell  along  the  vertical  arrows. 
For  example,  at  step  1,  xouoo  is  computed.  At  step  2,  xovoi  and  xiiqo  are  computed, 
with  the  latter  then  added  to  the  previously  computed  xofoo-  After  step  3,  the  partial 
answers  are  xot>o2>  xovoi  +  zivu,  and  xovoo  +  xitqo  4-  £2V20-  The  outputs  thus  come 
out  staggered  at  the  bottom  at  steps  4,  5,  6,  and  7. 

This  process  is  further  illustrated  in  figure  25.  Each  cell  is  an  A  4-  BC  cell  of 
figure  19.  The  result  is  fully  pipelineable,  as  is  illustrated  by  the  vector  y  following 
the  vector  x.  Once  the  pipe  is  filled,  a  set  of  four  coordinates,  corresponding  to  four 
different  output  vectors,  is  available  at  each  step. 
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Figure  24.  Data  Flow  in  the  Systolic  Architecture 
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5.3.2  Comparison  with  LU  Architecture 

Both  the  LU  architecture  and  the  systolic  architecture  are  fully  pipelineable.  Both 
can  be  used  to  compute  in  parallel  a  modulo  p  vector-matrix  multiplication  where  the 
matrix  is  arbitrary  (for  the  systolic  architecture),  or  arbitrary  nonsingular  (for  the  LU 
architecture,  after  possibly  rearranging  inputs).  Both  architectures  are  easily  parti¬ 
tioned  into  smaller  blocks,  say  by  rows  or  2  x  2  subgrids.  For  the  systolic  architecture, 
these  blocks  are  identical;  for  the  LU  architecture,  a  slight  modification  must  be  made 
depending  on  whether  certain  lines  are  input  or  output  lines. 

The  major  difference  between  the  two  architectures  is  the  timing  of  the  inputs  and 
outputs.  The  systolic  architecture  requires  the  inputs  and  outputs  to  be  staggered,  and 
thus  after  the  pipe  is  full  the  n  coordinates  entering  and  exiting  the  converter  at  each 
time  step  correspond  to  n  consecutive  algebraic- integer  results.  Therefore,  the  systolic 
architecture  requires  more  complicated  synchronization  before  and  after  the  converter. 
On  the  other  hand,  the  n  coordinates  entering  and  leaving  the  converter  for  the  LU 
architecture  correspond  to  a  single  algebraic-integer  result.  However,  long  buses  are 
needed  at  each  time  step.  This  may  be  a  concern  for  larger  matrices. 


5.4  VLSI  IMPLEMENTATION 

Both  the  LU  architecture  of  section  5.2  and  the  systolic  architecture  of  section  5.3 
are  highly  regular,  and  thus  well  suited  to  VLSI  implementation.  In  such  an  implemen¬ 
tation,  the  architecture  would  be  divided  up  evenly  into  chips,  with  as  much  of  the  grid 
of  the  architecture  put  onto  a  single  chip  as  feasible. 

Although  a  single  chip  implementing  both  architectures  is  possible,  since  both 
use  the  same  basic  A  +  BC  cell,  the  chip  to  be  fabricated  will  implement  the  LU 
architecture.  This  architecture  was  chosen  over  the  systolic  architecture  because  of 
I/O  considerations,  namely  the  fact  that  the  algebraic-integer  vectors  do  not  need  to 
be  staggered.  Furthermore,  the  long  buses  required  are  not  a  problem  for  the  degrees 
being  considered  (up  to  degree  8).  The  systolic  architecture  is  not  :mplemented  along 
with  the  LU  architecture  because  of  the  added  complexity  in  the  design  due  to  the 
extra  delays  needed.  The  I/O  assignments  on  the  chip  are  not  ideally  suited  to  both 
architectures.  It  should  not  be  difficult,  however,  in  adapting  the  actual  VLSI  layout 
to  a  systolic  approach  if  such  is  desired  in  the  future. 

Recall  that  the  grid  for  the  LU  architecture  consists  of  an  n  x  n  square  of  A  +  BC 
cells.  Furthermore,  for  the  applications  of  algebraic  integers  usually  considered,  n  is 
even,  and  this  will  be  the  assumption  here. 
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5.4.1  Chip  Layout  Limitation 


The  chip  to  be  fabricated  uses  an  84-pin  package  and  a  7.9-mm  x  9.2-mm  die 
size.  These  parameters  were  chosen  based  on  two  main  limitations  derived  from  the 
architecture. 

The  first  major  limitation  to  the  VLSI  implementation  of  the  LU  architecture  is 
modulus  programmability.  Specifying  a  prime  or  family  of  primes  beforehand  limits  the 
possible  sets  of  algebraic  integers.  Instead  ‘he  prime  needed  in  the  conversion  should 
be  an  input  (though  fixed  during  a  given  application).  This  ability  to  program  the 
prime  requires  a  larger  area  to  perform  the  modulo  arithmetic.  This  in  turn  places 
limits  on  the  VLSI  chips,  since  a  chip  consists  mainly  of  A  +  BC  cells,  each  of  which 
consists  of  a  modulo  adder  and  multiplier. 

The  other  major  limitation  to  the  VLSI  implementation  is  the  number  of  input 
and  output  lines,  under  the  assumption  that  the  values  in  the  architectures  are  to  be 
bused  in  parallel.  Area  estimates  for  the  A  +  BC  cells  suggested  that  four  of  these  cells 
would  fit  on  a  chip.  There  are  three  natural  ways  to  place  four  cells  in  an  n  x  n  grid:  in 
a  row,  in  a  column,  and  as  a  2  x  2  sub-grid  (recall  figure  19).  The  I/O  requirements  for 
just  the  cells,  i.e.,  ignoring  for  the  moment  the  prime,  the  cell  constants,  etc.,  are  9  lines 
for  the  row,  7  lines  for  the  column,  and  6  lines  for  the  2x2  sub-grid.  Here  the  lines 
are  6-bits  wide,  with  6  to  be  determined.  Notice  further  that  the  first  two  placements 
require  that  n  be  divisible  by  4.  Also,  the  2x2  placement  corresponds  itself  to  the 
grid  for  a  degree  2  extension  (such  as  QR.NS).  Hence,  placement  in  a  2  x  2  sub-grid  is 
suggested. 

The  prime  and  constants  could  be  read  in  by  a  single  1-bit  line.  However,  allowing 
wider  (6-bit)  lines  would  increase  the  generality  of  the  chip,  letting,  for  example,  the 
matrix  be  changed  during  processing  (though  this  may  reduce  the  speed  of  the  chip). 
Allowing  one  6-bit  line  for  the  prime  and  two  other  6-bit  lines  for  the  cell  constants 
would  bring  the  total  to  9  lines  for  the  2x2  sub-grid.  For  an  84-pin  package,  the 
implied  upper  limit  on  6  is  8.  Setting  6  =  8,  there  would  then  be  12  remaining  pins  to 
allow  for  the  control  lines,  the  clocks,  and  so  forth. 


5.4.2  Algorithms  for  the  A  +  BC  Cell 

The  computation  in  an  A  +  BC  cell  consists  of  a  multiplication  and  an  accumulation 
modulo  an  8-bit  modulus.  The  accumulation  can  be  absorbed  in  the  multiplication,  so 
that  the  two  major  operations  performed  in  the  cell  are  the  computation  of  A  +  BC 
and  the  reduction  of  the  result  modulo  the  modulus.  (It  should  be  noted  that  nothing 
inherent  in  the  design  requires  that  the  modulus  be  a  prime.) 


The  multiplication  uses  a  standard  array  multiplier,  see,  for  example,  [»]  .  Such  a 
multiplier  consists  of  an  8  x  8  square  grid  of  small  cells,  each  of  which  is  made  up  of 
a  full  adder  and  an  and  gate.  By  setting  the  initial  sums  to  the  bits  of  A,  rather  than 
zero,  the  multiplier  computes  A  +  BC ,  rather  than  just  BC .  This  result  is  16  bits  long, 
assuming  that  all  of  the  input  values  are  8  bits. 

The  multiplication/accumulation  is  followed  by  a  modulo  reduction,  which  is  es¬ 
sentially  a  division.  The  16-bit  number  N  is  divided  by  the  8-bit  modulus  M  to  produce 
a  quotient  Q,  which  could  be  9  bits,  and  an  8-bit  remainder  R.  At  each  successive  stage, 
N  is  updated  based  on  a  comparison  with  a  shift  of  M.  If,  for  the  current  value  of  N, 
N  >  2‘  M,  then  the  ith  bit  of  Q  is  1  and  N  is  decremented  by  2*  M;  otherwise  N  is 
unchanged  and  the  ith  bit  of  Q  is  0.  Here  i  runs  from  8  down  to  0.  After  the  last  loop, 
the  value  of  N  is  precisely  R.  In  VLSI,  the  inequality  N  >  2'  M  \s  checked  by  adding 
216  —  2‘  M  to  N  and  checking  for  a  carry.  It  is  then  straightforward  to  keep  the  old 
value  of  N  or  the  new  sum,  which  is  N  decremented  by  2*  M,  based  on  the  value  of 
the  carry.  The  carry  is,  in  fact,  the  ith  bit  of  Q.  This  algorithm  is  demonstrated  in 
figure  26. 

In  practice,  only  8-bit  adders,  not  16-bit  adders,  are  needed  to  perform  this  addi¬ 
tion.  This  is  because  216  —  2'  M  =  2‘(216_‘  —  M)  and  216-'  —  M  is  just  28  —  M  with 
8  —  i  leading  ones.  Hence,  the  8-bit  number  28  —  M  is  added  to  the  appropriate  8  bits 
of  N.  The  carry  of  the  16-bit  addition  is  just  the  next  higher  bit  of  N  or-ed  with  the 
carry  of  this  8-bit  addition.  All  of  the  more  significant  bits  of  N  are  necessarily  0,  and 
the  bits  of  N  below  the  8  bits  used  in  the  addition  are  not,  needed  (and  are  just  passed 
down).  The  example  of  figure  26  is  shown  in  figure  27  using  these  observations.  Here 
the  8-bit  addition  is  shown  in  the  box.  The  next  higher  bit  of  N  is  shown  to  the  left, 
and  the  unused  lower  bits  of  N  to  the  right. 
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N  =  35,161 


M  =  199 


Step  1 

Current  N 

2' 6  -  2a  M 

1000  1001  0101  1001 
0011  1001  0000  0000 

Carry  =  0 

Step  2 

Current  N 

2' 6  -  27  M 

1000  1001  0101  1001 
1001  1100  1000  0000 

Carry  =  1 

Step  3 

Current  N 

2’ 6  -  26  M 

0010  0101  1101  1001 
1100  1110  0100  0000 

Carry  =  0 

Step  4 

Current  N 

2' 6  -  2s  M 

0010  0101  1101  1001 
1110  0111  0010  0000 

Carry  =  1 

Step  5 

Current  N 

2' 6  -  24  M 

0000  1100  1111  1001 
1111  0011  1001  0000 

Carry  =  1 

Step  6 

Current  N 

2'6  -  23  M 

0000  0000  1000  1001 
1111  1001  1100  1000 

Carry  =  0 

Step  7 

Current  N 

2' 6  -  22  M 

0000  0000  1000  1001 
1111  1100  1110  0100 

Carry  =  0 

Step  8 

Current  N 

2' 6  -  2’  M 

0000  0000  1000  1001 
1111  1110  0111  0010 

Carry  =  0 

Step  9 

Current  N 

2' 6  -  2°  M 

0000  0000  1000  1001 
1111  1111  0011  1001 

Carry  =  0 

R  =  0000  0000  1000  1001  =  137 
Q  =  0  1011  0000  =  176 

35.161  *  176  x  199  +  137 


Figure  26.  A  Modulo  Reduction  Example  (16-bit  Adders) 
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N  =  35,161 


M  =  199 


Step  1 
Step  2 
Step  3 
Step  4 
Step  5 

Step  6 

Step  7 

Step  8 

Step  9 


10001001 

00111001 


0  1  0 


1  00010010 
00111001 


1  0 


0  0  1 


0  0  1 


0 


10  0  10  111 
00111001 


0 


0  0  1 


1  00101110  1 
00110011 


0 


11001111 

00110011 


0  0  1 


0  0  1 


0 


00010001 

0  0  1  ‘.0011 


0  0  1 


0 


00100010 

00110011 


0  1 


0 


01000100 
0  0  1  1  0  0  1  1 


1 


0 


10001001 

00110011 


Result  of  OR 

Result  of  OR 

Result  of  OR 

Result  of  OR 

Result  of  OR 

Result  of  OR 

Result  of  OR 

Result  of  OR 

Result  of  OR 


R  =  1000  1001  =  137 
Q  =  0  1011  0000  =  176 

35,161  =  176  x  199  +  137 


0 

1 

0 

1 

1 

0 

0 

0 

0 


Figure  27.  A  Modulo  Reduction  Example  (8-bit  Adders) 
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5.4.3  Layout  and  Testing 


This  section  contains  some  pictures  of  the  chip.  The  chip  was  designed  using 
scalable  CMOS  technology.  Figure  28  shows  the  multiplier/accumulator.  The  8  bits 
of  C  come  from  the  right;  the  8  bits  of  A  and  B  come  in  from  the  bottom.  The  16 
bits  of  A  +  BC  come  out  from  the  left  and  top.  Figure  29  shows  the  modulo  reduction. 
The  16  bits  of  the  input  come  in  from  the  top  and  right;  the  remainder  exits  from  the 
bottom.  There  are  9  rows,  each  an  8-bit  adder.  After  every  three  rows,  there  is  a  set 
of  registers  for  pipelining  (the  last  set  is  not  shown,  since  it  is  outside  this  cell).  The 
registers  allow  for  pipelining;  each  section  isolated  by  registers  takes  roughly  the  same 
time  as  the  multiplier.  Figure  30  shows  the  complete  A  +  BC  cell,  with  the  modulo 
reduction  on  the  left  and  the  multiplier  on  the  right.  There  is  a  set  of  registers  between 
the  two.  Finally,  figure  31  shows  a  picture  of  the  whole  chip  with  four  cells  forming  a 
2x2  grid. 

Currently,  the  chip  is  undergoing  testing  before  fabrication.  SPICE  simulations 
for  the  basic  full  adder  used  in  both  the  multiplier  and  the  modulo  reducer  indicate  a 
speed  of  roughly  3  MHz.  The  chip  will  be  fabricated  using  3  pm  technology. 


5.5  CONCLUSION 

The  conversion  into  and  out  of  the  inner  level  of  parallelism  of  an  algebraic-integer 
implementation  is  a  modulo  p  vector-matrix  multiplication.  When  the  algebraic  inte¬ 
gers  are  in  a  natural  representation  (eq.  5.1),  the  conversion  matrix  is  a  Vandermonde 
matrix.  Some  facts  were  derived  about  the  Vandermonde  matrix,  including  its  LU  fac¬ 
torization.  This  latter  result  was  used  in  the  illustration  of  an  architecture  to  perform 
the  conversion. 

This  architecture,  called  the  LU  architecture,  uses  the  LU  factorization  of  a  matrix 
to  perform  vector-matrix  multiplication.  With  some  slight  modification,  it  can  be  used 
with  any  nonsingular  matrix.  The  LU  architecture  is  semisystolic,  in  that  computations 
are  performed  in  a  grid  of  small  basic  cells.  Unfortunately,  long  buses  are  needed 
to  distribute  information  globally.  The  LU  architecture  was  compared  to  a  systolic 
architecture  using  the  same  basic  cells.  Although  with  the  systolic  architecture  all 
information  is  just  distributed  locally,  the  input  and  output  vectors  need  to  be  staggered. 
This  is  not  the  case  with  the  LU  architecture,  which  requires  these  vectors  to  be  available 
all  at  once.  Because  of  this  feature,  the  LU  architecture  was  chosen  for  implementation 
in  VLSI. 

A  degree  n  conversion  using  the  LU  architecture  requires  an  n  x  n  grid  of  basic 
cells.  For  the  VLSI  implementation,  this  grid  is  blocked  off  into  n/ 2  x  n/2  grid  of  chips, 
each  chip  containing  a  2  x  2  grid  of  basic  cells  (n  is  assumed  even).  The  chip  is  limited 
to  an  8-bit  modulus,  so  that  all  values  are  also  limited  to  8  bits.  The  matrix  can  either 
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be  fixed  or  changed  during  computation,  although  the  latter  may  reduce  speed.  The 
chip  has  been  designed  using  scalable  CMOS  technology.  SPICE  simulations  indicate 
a  speed  of  3  MHz.  The  chip  is  currently  undergoing  testing  before  fabrication  using 
3  fim  technology. 
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Figure  28.  The  Multiplier/ Accumulator 
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Figure  29.  The  Modulo  Reduction 
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Figure  30.  The  A  +  BC  Cell 
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Figure  31.  The  LU  Architecture  Chip 


SECTION  6 


ALGEBRAIC-INTEGER-TO- ANALOG 
(OR  DIGITAL)  CONVERSION 


After  processing  in  an  algebraic-integer  RNS  system  is  completed,  the  results  are 
converted  from  the  inner  and  outer  level  of  RNS  parallelism  back  into  algebraic- integer 
form.  The  algebraic  integers  can  be  assumed  to  be  of  the  form 

a.QU}Q a\u\  an—\ujn—\,  (6.1) 

where  the  coefficients  a;  are  integers,  which  are  usually  much  larger  than  at  the  in¬ 
put  due  to  growth  during  the  processing,  and  {uo,  uq,  ...,  u;n-i}  corresponds  to  the 
algebraic- integer  basis.  The  final  quantization  process,  referred  to  as  requantization  to 
distinguish  it  from  the  input  quantization  problem,  involves  evaluating  (6.1)  using  finite 
precision  approximations  of  the  a>is. 

This  final  evaluation  can  be  real  or  complex  depending  on  the  choice  of  algebraic 
integers.  If  the  uqs  are  represented  in  binary  form,  then  a  binary  result  is  computed. 
If  desired,  this  binary  result  can  be  input  to  a  conventional  digital-to-analog  (D/A) 
converter  to  obtain  an  analog  result. 

The  key  issue  in  this  section  is  determining  how  accurate  the  approximations  of  the 
uis  have  to  be  so  that  requantization  degrades  only  negligibly  the  overall  performance. 
That  the  coefficients  in  (6.1)  can  become  quite  large  would  seem  to  imply  that  the 
uJiS  may  have  to  be  approximated  very  accurately.  However,  this  has  turned  out  to 
not  always  be  the  case.  In  the  case  of  computing  a  feist  Fourier  transform  (FFT)  with 
cyclotomic  integers,  [4]  reported  that  extensive  simulations  showed  that  approximations 
(in  this  case  of  powers  of  u — a  complex  16th  root  of  unity)  having  comparable  precision 
to  those  employed  in  the  input  quantization  and  for  the  twiddle  factors  were  generally 
sufficient,  with  very  little  gain  in  performance  if  more  accurate  approximations  were 
used.  The  paper  [4]  also  provided  a  statistical  analysis  as  to  why  this  was  so. 

This  section  deals  with  issues  common  to  general  algebraic-integer  requantization, 
using  the  algebraic  integers  Z[\/2  +  \/2  ]  for  illustration.  Performance  data,  which  is 
consistent  with  the  conclusion  in  [4]  ,  is  presented  for  three  FIR  filter  examples.  An 
error  analysis  shows,  for  a  processing  function  consisting  of  a  sum  of  products  (which 
are  assumed  to  be  orthogonal  random  variables),  that  the  performance  degradation,  ex¬ 
pressed  as  a  difference  of  output  SNRs,  is  independent  of  the  output  coefficient  size  and 
summation  length.  The  requantization  for  complex  algebraic  integers  can  be  reduced 
to  the  real  case  by  treating  real  and  imaginary  parts  separately.  This  is  illustrated  for 
2[e-*-i/i6j  usjng  2[y/2  +  \/2],  Finally,  implementation  issues  are  discussed,  including 
the  possibility  of  implementing  the  requantization  using  an  RNS. 
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6.1  ALGEBRAIC-INTEGER  REQUANTIZATION  PERFORMANCE 

The  final  stage  in  any  algebraic-integer  RNS  system  is  the  evaluation  of 
(6.1)  using  finite  precision  approximations  of  elements  of  the  algebraic-integer 

basis.  The  ring  Z[\/2  +  \/2]  represented  with  the  nonpolynomial  basis  B  = 
{1,  v2  +  \/2,  y/2,  yj 2  —  \J2  }  will  be  used  to  illustrate  the  issues  involved.  The  num¬ 
bers  in  this  case  have  the  form 

cl  =  o,q  +  a\  \/2  +  y/2  -f  +  ai\pT—  \/2.  (6-2) 

As  before,  Z[B]jv/  denotes  the  set  of  elements  of  the  form  (6.2)  with  coefficients  in  the 
range  [— M/2,  M/2]. 

To  form  the  output  a  in  this  case,  an  inner  product  of  the  algebraic-integer  coeffi¬ 
cient  vector  A  =  (ao,  ai,  02, 03)  must  be  taken  with  the  basis  vector  ft, 

ft  =  (l,  s/2  +  y/2,  y/2,  yj2-y/2)  . 

This  task  is  complicated  by  the  fact  that  three  of  the  components  of  the  basis  vector 
cannot  be  represented  exactly  as  binary  fractions.  The  obvious  solution — use  as  many 
bits  as  may  be  required  for  the  basis  vector  components — may  not  be  feasible,  as  more 
bits  translates  into  slower  evaluations.  This  could  make  this  final  evaluation  the  system 
bottleneck. 

A  practical  requirement  is  that  the  components  of  ft  be  represented  by  numbers  of 
b  bits  or  less,  where  b  is  determined  by  the  speed  requirements  of  the  system.  A  degree 
of  freedom  still  exists  even  when  b  has  been  specified,  due  to  the  fact  that  a  scaled 
version  of  the  output  a  is  just  as  good  as  a  itself  as  long  as  the  scale  factor  remains 
constant.  Thus,  one  can  choose  to  represent  A:ft,  k  a  constant  scale  factor,  rather  than 
ft  itself  if  this  yields  a  better  approximation.  In  either  case,  the  6-bit  approximation  of 
the  basis  vector  is  ft, 

kui  =  min  |[fa*/,-],  2b  -  1  j  ,  t  =  0,  1,  2,  3,  (6.3) 

where  u>,  is  the  ith  component  of  ft,  and  the  square  brackets  indicate  a  rounding  to 
the  nearest  integer  (k  is  considered  to  be  1  for  the  non-scaled  approximation).  The 
advantages  of  scaling  can  be  seen  when  approximating  the  numbers  (\/2,  \/8).  Using 
the  scale  factor  k  =  l/\/2  results  in  the  numbers  (1,  2),  which  can  be  represented 
exactly  with  only  two  bits. 

How  does  one  choose  the  proper  scale  factor  k  given  the  basis  vector  and  the 
number  of  bits  used  in  approximating  it?  Assume  that  the  rescaled  (the  factor  of  k  has 
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been  removed)  exact  output  is  a ,  and  the  rescaled  output  of  a  finite-precision  evaluation 
is  a.  Then  the  quantity  to  be  minimized  is  the  squared  error  (a  — a)2.  Letting  H  =  Q,  —  Q, 
the  error  vector  due  to  the  approximation  of  Cl,  then 

(a  -  a)2  =  (ao7?o  4-  a\T)\  +  aim  +  mm)2 ,  (6-4) 

where  7/,  is  the  ith  component  of  H.  On  the  assumption  that  the  coefficients  a,  are  in¬ 
dependent,  zero-mean,  and  identically  distributed  random  variables,  then  the  expected 
error  due  to  the  basis  vector  approximation  error  is 

E((a  -  a)2)  =  E(al)T)l  +  E{a\)r{[  +  E{a\)r)l  +  £(a2)r/f 

=  E{al){r)2  +  7?2  +  +  *73  )•  (6-5) 

That  this  error  depends  on  the  expected  value  of  the  square  of  the  output  coefficient 
size  means  that  the  requantization  error  is  dependent  on  the  processing  function. 

The  quantity  in  (6.5)  is  minimized  by  minimizing  the  sum  of  the  squares  of  the 
basis  vector  approximation  errors  rjj.  Using  (6.3),  (6.5),  and  the  equation  for  H ,  it  can 
be  shown  that  that  sum  is  a  piecewise-continuous  quadratic  function  of  k  containing  a 
finite  number  of  minima.  These  can  be  solved  for  exactly  and  compared  to  find  the  scale 
factor  that  minimizes  the  squared  approximation  error.  Table  9  gives  optimal  values  of 
k  for  b  ranging  from  8  to  12  bits.  In  addition,  the  integer  approximation  kCl  (which  is 
what  the  requantization  hardware  would  actually  use)  and  the  rescaled  approximation 
Cl  are  given  for  the  various  values  of  b.  The  first  four  entries  are  the  exact  values  of  the 
components  of  Cl  to  six  decimal  places  and  are  there  for  comparison. 

To  test  the  performance  of  finite-precision  requantization,  a  simulation  of  an 
algebraic- integer  FIR  filter  was  designed.  Three  filters — a  lowpass  filter,  a  highpass 
filter,  and  a  multiple  passband/stopband  filter — were  simulated  on  nine  sets  of  test 
input  data.  The  test  data  consisted  of  a  combination  of  sinusoids  and  additive  white 
Gaussian  noise.  For  each  run,  an  algebraic-integer  range  M  was  chosen.  The  set  Z[B]a/ 
was  then  used  to  quantize  the  input  data  by  going  through  a  12-bit  A/D  converter  and 
mapping  to  the  appropriate  element  in  Z[B]a/  (the  two-stage  approach  of  section  3.2). 
The  quantized  data  was  put  through  a  filter  arrived  at  by  quantizing  one  of  the  three 
filters — initially  designed  with  floating-point  coefficients — with  the  closest  scaled  ele¬ 
ment  of  Z[B]a/.  Scaling  here  is  used  in  the  same  manner  and  for  the  same  reasons  as 
it  was  in  approximating  Cl,  only  now  the  set  of  available  representation  levels  consists 
of  algebraic  integers  rather  than  just  integers  as  before  (simulation  design  issues  are 
covered  in  greater  detail  in  the  next  section). 

To  measure  the  effect  on  performance  of  the  finite-precision  final  evaluation,  tiie 
performance  when  this  evaluation  is  performed  exactly  must  first  be  measured.  This 
was  done  by  comparing  the  output  of  the  algebraic- integer  simulation,  assuming  exact 
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Table  9.  Optimal  Approximations  for  6-bit  Requantization 

for  Z[\/2  +  \/2] 


bits 

k 

ktl 

h 

oo 

— 

— 

1.000000 

1.847759 

1.414214 

0.765367 

8 

110.983 

Ill 

1.000157 

205 

1.847136 

157 

1.414636 

85 

0.765886 

9 

255.992 

256 

1.000029 

473 

1.847711 

362 

1.414104 

196 

0.765648 

10 

473.016 

473 

0.999966 

874 

1.847717 

669 

1.414328 

362 

0.765302 

11 

1091.042 

1091 

0.999961 

2016 

1.847775 

1543 

1.414244 

835 

0.765323 

12 

1759.977 

1760 

1.000013 

3252 

1.847751 

2489 

1.414223 

1347 

0.765351 

requantization,  with  the  output  of  a  floating-point  simulation  (i.e..  a  simulation  where 
neither  the  floating-point  input  data  nor  the  original  floating-point  filter  coefficients 
were  quantized,  and  where  all  computations  took  place  in  full  floating-point  accuracy). 
The  performance  of  the  algebraic-integer  simulation  with  exact  requantization  was  mea¬ 
sured  by  computing  the  signal-to-noise  ratio  (SNR)  of  that  simulation  with  respect  to 
the  floating-point  simulation. 
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Table  10.  SNR  Degradation  for  6-bit  Polynomial  Evaluation 


ASNR  as  a  function  of  the  range  M 

bits 

2 

4 

6 

8 

10 

12 

8 

0.01 

MEM 

0.84 

3.63 

7.09 

10.41 

9 

0.01 

n 

0.11 

1.04 

1.43 

3.49 

10 

0.00 

mm 

-0.02 

0.09 

0.24 

1.91 

11 

0.00 

-0.01 

0.01 

-0.01 

-0.03 

0.55 

12 

0.00 

0.00 

0.01 

0.00 

0.02 

0.26 

To  be  more  precise,  let  the  jth  output  of  the  floating-point  simulation  for  a  specific 
set  of  input  data  and  a  specific  filter  be  called  ay,  and  let  the  jth  output  of  the  analogous 
algebraic-integer  exact  requantization  simulation  be  called  ay.  Then  the  SNR  with  exact 
requantization  is 


SNRi  =  lOlogjo 


y  a2 


Ey(ai  -  aj)2 


(6.6) 


Call  the  jth  output  of  the  finite-precision  polynomial  evaluation  simulation  ay.  Then 
the  SNR  in  that  case  is 


SNR?  =  101og10 


Ey(«i  “ 


(6.7) 


The  degradation  due  to  using  H  to  generate  ay  rather  than  using  12  to  generate  ay,  is 
the  difference  between  the  two  SNRs,  i.e., 

ASNR  =  SNRi  -  SNR?.  (6.8) 


The  above  quantity  has  been  computed  for  two  different  sets  of  input  data  put 
through  all  three  filters — in  other  words,  six  combinations.  Each  of  these  combinations 
was  run  using  M  —  2,  4,  6,  8,  10,  12  and  requantization  precisions  of  8  to  12  bits,  as 
well  as  an  exact  floating-point  requantization.  In  addition,  a  floating-point  simulation 
of  each  combination  was  run  to  set  up  a  signal  reference  against  which  SNRs  could  be 
measured.  The  results  were  divided  according  to  M  and  6,  and  are  tabulated  in  table 
10. 

The  advantages  of  scaling  before  approximation,  as  done  in  (6.3),  can  be  seen  in 
the  fact  that  an  SNR  degradation  of  more  than  13  dB  was  observed  when  scaling  was 
not  used  in  the  6  =  8  bits,  M  =  8  case  (as  compared  to  only  a  3.63-dB  degradation 
for  an  optimal  scaled  approximation).  In  that  instance,  the  w,s  were  approximated  by 
writing  them  as  binary  fractions  and  rounding  to  8  bits. 
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It  seems  clear  from  the  table  that  degradation  in  SNR  for  a  given  precision  6  is 
directly  linked  to  the  size  of  M,  which  in  turn  is  related  to  the  accuracy  of  the  input  and 
filter  coefficient  approximations.  If  the  limit  on  acceptable  degradation  is  set  at  0.5  dB, 
then  a  requantization  precision  set  at  the  same  level  as  the  input  and  filter  coefficient 
precision  is  all  that  is  required  to  obtain  acceptable  performance.  For  example,  if 
M  =  4,  which  corresponds  to  better  than  8-bit  performance  (see  section  3.4),  6  =  8 
suffices. 

The  reader  may  wonder  why  some  (albeit  small)  negative  SNR  degradations  occur 
in  table  10,  indicating  that  the  imprecise  requantization  improved  performance,  rather 
than  degrading  it  as  expected.  The  reason  for  this  lies  in  the  fact  that  the  imprecise 
evaluation  merely  adds  more  errors  onto  an  already  corrupted  result.  On  average,  one 
expects  the  error  to  increase  as  a  result,  but  there  is  no  reason  why  the  two  errors  being 
added  might  not  tend  to  cancel  each  other  in  specific  cases.  Since  only  six  runs  are 
being  averaged  to  produce  each  entry  in  table  10,  it  is  not  surprising  that  a  noticeable 
variation  from  the  expected  degradation  occurs.  When  this  expected  degradation  is 
very  close  to  zero,  the  variation  can  result  in  negative  ASNRs.  One  would  expect  this 
phenomenon  to  disappear  for  very  large  sample  sizes. 


6.2  AN  ERROR  ANALYSIS 

The  results  in  table  10  apply  only  to  the  finite-impulse  response  filtering  simula¬ 
tions  that  were  run.  A  more  valuable  result  would  be  an  error  analysis  of  requantization 
in  a  more  general  processing  framework. 

System  performance  for  finite-impulse  response  filtering  is  usually  measured  by 
SNR.  That  is  why  SNR  degradation  was  considered  to  be  an  appropriate  measure  of 
finite  precision  requantization  in  section  6.1.  In  a  more  general  setting  SNR  may  be 
an  inconvenient  way  of  quantifying  errors.  Even  worse,  it  may  not  even  be  a  relevant 
measure  at  all.  The  ultimate  error  measure  is.  of  course,  the  errors  themselves,  but 
these  are  usually  cumbersome  to  work  with  directly.  If  one  assumes  the  errors  due  to 
requantization  are  a  sequence  of  zero-mean  uncorrelated  random  variables  (a  common 
assumption)  then  the  most  common  measure  of  the  error  sequence  is  the  expectation 
of  the  squared  error.  Call  this  cr,, 


0%  =  E  ((a  —  a)2)  .  (6.9) 

where  E  indicates  an  expectation  (the  time  index  j  is  omitted  in  this  section  under  the 
assumption  that  all  sequences  are  wide-sense  stationary).  By  (6.5),  <r~  is  proportional 
to  the  expected  square  of  the  algebraic  integer  coefficient  ao.  Under  the  assumptions 
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used  in  (6.5),  it  can  be  shown  that 


E(a~ )  =  E((ao  -f  ai  \J2  +  \/2  4-  ct2\/2  +  a$\/2  —  \/2  )2) 

=  (1  +  (2  +  y/2)  +  2  + (2-  y/2))E(al) 

=  7  E(4).  (6.10) 

Assuming  a  sum-of-products  process,  each  ideal  output  term  a  is  a  sum  of  N  product 
terms  p<, 

N 

i=l 


and  each  pi  is  actually  approximated  (because  of  initial  quantizations)  by  p,  with  an 
error  e,  =  p,  —  p,,  so  that 

JV 

a  =  1L  Pi‘ 

<=1 

Assuming  the  p,  are  identically  distributed  orthogonal  random  variables, 


Using  (6.5),  (6.10),  and  (6.11)  in  (6.9)  yields 


a 


2 

r 


A rE{pp 

7 


{Vo  +  Pi  +  p]  +  vl)- 


(6.11) 


(6.12) 


It  can  be  seen  that  the  requantization  error  grows  linearly  as  the  number  of  product 
terms  N. 

Even  though  the  main  focus  of  section  6  is  the  effect  of  requantization,  one  might 
also  be  interested  in  errors  due  to  initial  quantizations,  a*  =  E((a  —  a)').  It  is  not 
difficult  to  show  that 

a]  =  NE  (e“)  ,  (6.13) 

if  one  assumes  that  the  are  identically  distributed  zero-mean  uncorrelated  random 
variables.  Given  <7%,  a",  and  the  expected  signal  power  E{or)  =  NE{p\),  one  can 
construct  almost  all  conceivable  interesting  measures  of  error  (i.e.,  total  error  variance 
cr~  +  of  i  relative  error  variances,  etc.).  Since  E(p\ )  =  E{p\)  +  E(e\),  and  since  one  has 
explicit  control  over  N  and  the  basis  vector  approximation  error,  the  three  quantities 
specified  in  (6.11 )— (6. 13)  depend  only  on  E{p\)  and  E(e\).  These  latter  two  quantities 
are  specific  to  the  processing  situatiou,  but  once  they  have  been  determined  the  error 
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performance  of  the  system  can  be  completely  characterized  for  all  values  of  N  and 
all  basis  vector  approximations.  Hence  the  goal  of  this  section — to  determine  how 
the  accuracy  of  the  basis  vector  approximation  effects  the  overall  performance — can  be 
achieved  using  (6.11)-(6.13)  without  need  for  experimental  determination.  For  example, 
the  accuracy  needed  in  the  basis  vector  approximation  can  be  determined  from  (6.12) 
when  E(p2)  has  been  determined,  and  when  an  acceptable  amount  of  requantization 
error  has  been  decided  upon. 


Applying  these  results  to  the  filtering  example,  consider  the  following  error  mea¬ 


sure: 


ESNRi  =  10log10 


E(a-) 


£((<*- a)2)  J 

It  is  not  unreasonable  to  expect  SNRi  to  behave  like  the  quantity  ESNRi  as 


(6.14) 


j= i 


9 


if2(Q}-aj)2 

J  i=i 


E(a')  as  J 


oo 


E(( a  —  a)2)  as  J  — ►  oo 


(6.15a) 

(6.15b) 


where  convergence  occurs  with  probability  1. 

Proceeding  in  a  like  manner  with  SNR2,  its  analogue  is  ESNR2, 

f  E[cr) 

ESNR2  =  ,Olog10|^riHJ. 

Thus,  the  degradation  in  the  expected,  signal  to  expected  noise  ratio  is 


(6.16) 


AESNR  =  ESNRi  -  ESNR2 


=  10  log  io 


'£((<>  ~«)2)' 
.£((«- «)2).  ' 


(6.17) 


Assuming  that  the  errors  due  to  quantization  (a  —  a)  are  uncorrelated  with  the  errors 
due  to  requantization  (a  —  a),  it  follows  that 

E((q  —  a)2)  =  E((a  —  a)2  +  (a  —  a)2  +  2(a  —  a)(a  —  a)) 

=  E((a  -  a)2  +  (a  -  a)2).  (6.18) 


Substituting  (6.18)  into  (6.17), 


AESNR  =  10log10 
=  10  log10 


E((a-d)2) 

E((a-a)  2) 


(6.19) 
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A  straightforward  substitution  yields 


AESNR  =  10logi0 


(qo  +  q?  +  v%  +  n)E(p2\) 

lE{e\) 


(6.20) 


The  felicitous  and  surprising  result  here  is  that  (6.20)  does  not  depend  on  the  summation 
length  N  or  E(al).  For  a  fixed  processing  function  consisting  of  a  sum  of  products 
(with  no  assumption  on  the  number  of  terms  in  the  products  needed),  the  AESNR, 
due  to  imprecise  requantization,  will  not  change  when  the  summation  is  lengthened, 
even  though  the  size  of  the  algebraic-integer  coefficients  increases  due  to  the  longer 
computation.  This  says  that  one  can  use  table  10  to  predict  the  performance  of  linear 
filters  of  arbitrary  length. 

Finally,  note  that  the  expression  for  AESNR  is  in  qualitative  agreement  with  the 
results  of  table  10.  The  size  of  E{p\ )  will  hardly  be  affected  by  the  range  M,  but  as  M 
is  increased,  the  quantity  E(t\)  will  decrease,  resulting,  for  a  fixed  requantization  level 
(Vo  +  +  r/2  + 173  is  constant),  in  an  increase  in  the  fraction  of  (6.20).  This  agrees  with 

the  increasing  values  of  AESNR  along  the  rows  of  table  10. 


6.3  COMPLEX  ALGEBRAIC-INTEGER  REQUANTIZATION 

Complex  algebraic-integer  requantization  involves  a  sum  of  products  where  each 
product,  in  general,  involves  am  integer  times  a  complex  number.  Rather  than  using 
a  full  complex  multiplier,  it  is  more  efficient  to  treat  the  product  as  two  real  multipli¬ 
cations.  Thus,  it  is  natural  to  treat  real  and  imaginary  parts  separately  and  reduce 
the  calculation  to  one  involving  real  algebraic  integers.  This  approach  is  illustrated  for 
2[c2in/16]  usjng  t}je  reaj  subring  Z[y/2  +  y/2).  In  the  following,  u>  =  e2x‘/16. 

An  element  x  +  yi  =  ao  +  a\u  -i-  •  •  •  -f-  aju>7  of  Z [a/]  has  real  and  imaginary  parts 
given  by 


(6.22) 

9  =  +  (6.23) 

These  formulas  can  be  derived  by  substituting  u>  =  \J2  -f  \/2  /2  +  i\J 2  — \/2  /2,  or  bv 
appealing  to  the  results  of  section  3.3,  in  particular  the  change  of  basis  matrix  (3.13). 

Formulas  (6.22)  and  (6.23)  imply  that  to  implement  the  requantization  for  Z[u;j 
(expressed  in  terms  of  the  polynomial  basis),  the  vector  of  coefficients  (ao,  ai,  ....  a-j) 
can  be  transformed  by  the  change  of  basis  matrix  given  in  (3.13),  and  the  resulting 
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vector  split  in  half  and  input  into  two  Z(v 2  +  >/2]  requantizers,  which  are  implemented 
using  the  nonpolynomial  basis.  The  entries  in  the  transformation  (3.13)  can  be  scaled 
by  2  to  eliminate  the  fractions  ±1/2,  and  the  transformation  implemented  with  three 
additions,  three  subtractions  and  two  shifts  (for  implementing  the  multiplications  by 
2). 


6.4  IMPLEMENTATION  ISSUES 

The  final  requantization  involves  forming  an  inner  product  between  the  current  vec¬ 
tor  of  algebraic-integer  coefficients  and  a  fixed  vector  of  basis  approximations.  As  such, 
the  fundamental  operation  is  a  multiply  and  accumulate.  Off-the-shelf  components  can 
be  used  to  implement  this  function  provided  the  speed,  algebraic-integer  coefficient  size, 
and  basis  approximation  accuracy  requirements  can  be  satisfied.  Currently,  commercial 
multiply-and-accumulate  components  are  available  that  perform  16-bit  x  16-bit  fixed- 
point  multiplications  with  up  to  40  bit  accumulations  with  clock  cycle  time  ranging 
from  50  to  100  ns  (up  to  20  MHz  throughput  rate). 

If  the  accuracy  requirements  of  an  application  imply  that  large  algebraic-integer 
coefficients  and  highly  accurate  basis  approximations  must  be  used  in  the  requantiza¬ 
tion,  and  if  high  throughput  is  required,  then  performing  the  requantization  using  an 
RNS  may  be  the  best  solution.  In  this  case,  the  basis  approximations  are  scaled  to 
integers  using  an  appropriately  large  scale  factor.  The  RNS  for  the  requantization  is 
formed  by  adding  moduli  to  the  the  existing  RNS  of  the  algebraic- integer  processor  to 
contain  the  increased  range.  The  residue  representations  that  result  from  the  inner-level 
reconversion  are  input  directly  to  the  second  RNS  processor.  The  inputs  corresponding 
to  the  additional  moduli  are  determined  by  a  base  extension  process,  one  for  each  co¬ 
ordinate  position.  A  final  processor  that  implements  the  Chinese  Remainder  Theorem 
(CRT)  is  required  .  The  fractional  representation  method  of  section  4  is  suggested, 
since  the  full  output  precision  is  probably  not  needed  at  this  stage. 

Is  is  interesting  to  observe  that  the  two-stage  RNS  processor  described  above  can 
be  viewed  as  a  single  RNS  process.  A  vector  of  residues  produced  by  the  inner  channels 
for  a  given  moduli  is  multiplied  by  the  reconversion  matrix.  Next,  requantization  is  per¬ 
formed  by  computing  the  inner  product  of  the  resulting  vector  and  the  vector  of  residues 
of  the  basis  approximations,  with  the  result  being  input  to  the  final  CRT  processor. 
Instead  of  computing  the  result  in  this  order,  however,  the  inner-level  conversion  and 
the  final  requantization  can  be  combined  by  premultiplying  the  reconversion  matrix  by 
the  vector  of  basis  residues,  to  produce  a  reconversion  vector.  The  output  conversion 
then  involves  taking  the  inner  product  of  this  reconversion  vector  and  the  vector  of 
residues  produced  by  the  inner  channels,  with  the  result  still  being  input  to  the  CRT 
processor. 
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The  practicality  of  the  simplified  algebraic-integer  RNS  processor  described  above 
depends  in  large  part  on  the  number  of  extra  moduli  that  are  needed  to  contain  the 
dynamic  range  of  the  requantization  result.  Additional  processing  channels  (replicated 
by  the  degree  of  the  algebraic  integer-extension)  are  required  for  each  of  the  additional 
moduli.  Since  the  processing  should  be  of  sufficient  complexity  to  amortize  the  overhead 
of  RNS  conversion  and  reconversion,  even  the  addition  of  a  single  extra  moduli  may 
negate  the  advantage  of  the  simplified  output  conversion.  In  many  processing  situations 
the  range  required  to  contain  the  requantization  result  will  be  considerably  larger, 
and  the  two-stage  RNS  implementation  will  be  more  appropriate.  If,  however,  the 
dynamic  range  after  requantization  is  comparable  to  (or  smaller  than)  the  dynamic 
range  of  the  original  RNS,  as  would  be  the  case  if  the  requantization  values  correspond 
to  very  small  positive  quantities,  then  the  simplified  output  conversion  method  would 
be  advantageous. 

6.5  CONCLUSION 

This  section  treated  the  final  requantization  from  the  algebraic- integer  number 
representation  back  to  a  conventional  binary  or  analog  representation.  An  example  of 
an  FIR  filter  (a  sum  of  products)  with  numerical  quantities  represented  by  elements 
of  Z(\/2  +  V5]  was  used  to  illustrate  the  issues  involved,  including  the  desirability  of 
scaling  the  basis  values  to  improve  their  approximations  and  the  efficient  treatment  of 
complex  requantization.  In  this  case,  the  degradation  of  the  overall  performance  due  to 
requantization  errors  was  negligible  provided- the  basis  values  were  approximated  with 
a  precision  comparable  to  the  precision  used  for  the  inputs  and  filter  coefficients.  A 
similar  conclusion  was  reached  in  [4]  for  the  FFT.  An  error  analysis  showed  that,  for  a 
general  sum-of-products  calculation,  where  the  products  are  assumed  to  be  orthogonal 
random  variables,  the  performance  degradation,  expressed  as  a  difference  of  SNRs,  is 
independent  of  the  algebraic- integer  coefficient  sizes  and  the  length  of  the  summation. 

However,  there  is  no  guarantee  that  the  heuristics  derived  for  the  FIR  filter  (prod¬ 
uct  with  two  terms)  or  the  FFT  (most  of  the  simulation  results  in  [4]  were  for  a 
256-point  radix  16  FFT — a  sum  of  3-term  products)  will  apply  to  arbitrary  processing 
situations  or  functions,  particularly  situations  or  functions  that  involve  product  terms 
significantly  larger  than  considered  in  these  past  studies.  The  error  analysis  implied 
that  the  magnitude  of  the  products  themselves  adversely  affects  the  requantization  er¬ 
ror.  These  magnitudes  depend,  at  least,  on  the  magnitude  of  the  input  data  and  on 
the  number  of  terms  involved  in  the  products.  As  one  example  of  a  more  complicated 
situation,  computing  a  single  term  of  the  adjoint  of  a  k  x  k  matrix  requires  a  sum 
consisting  of  (k  —  I)!  products  with  (k  —  1)  terms  each.  In  such  situations,  studies 
similar  to  the  ones  in  this  section  would  have  to  be  conducted  to  obtain  the  appropriate 
requantization  precision  heuristic.  The  performance  degradation  due  to  finite  preci¬ 
sion  requantization  should  be  independent  of  the  number  of  terms  in  the  sum,  so  this 
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calibration  of  the  requantization  precision  can  be  determined  using  sums  with  shorter 
length. 

Commercial  multiply-and-accumulate  chips  can  be  used  to  implement  the  sum- 
of-products  calculation  involved  in  the  requantization.  Current  devices  can  handle 
coefficient  sizes  and  basis  approximations  of  up  to  16  bits  at  a  clock  rate  of  up  to 
20  MHz.  For  applications  with  more  severe  accuracy  or  speed  requirements,  an  RNS 
implementation  of  the  requantization  is  suggested,  yielding  a  two-stage  RNS  implemen¬ 
tation.  A  simpler  RNS  implementation,  which  combines  the  inner-level  conversion  with 
the  requantization,  is  possible  if  the  increase  in  the  dynamic  range  requirements  due  to 
requantization  are  negligible. 
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SECTION  7 

FUTURE  WORK  AND  CONCLUSION 


The  objective  of  this  work  was  to  provide  solutions  to  the  quantization  and  con¬ 
version  problems  involved  in  algebraic- integer  processing,  thus  paving  the  way  for  a 
variety  of  algebraic-integer  RNS  processors.  Future  work  related  to  this  effort  could 
involve  the  implementation  of  the  designs/structures  contained  in  this  report  as  well 
as  additional  studies  of  the  quantization  properties  of  finite  sets  of  algebraic  integers 
(for  example,  the  conjecture  at  the  end  of  section  3.3  concerning  the  performance  of 
the  separate  real  and  imaginary  complex  quantization  scheme).  More  importantly,  the 
utility  of  algebraic-integer  RNS  processing  must  be  evaluated  by  comparing  the  method 
with  other  processing  schemes  in  the  case  of  specific  applications. 

This  section  outlines  future  work  involving  an  algebraic-integer  brassboard,  which 
initially  will  be  used  to  implement  an  algebraic-integer  RNS  finite-impulse-response 
(FIR)  filter.  In  part,  the  purpose  of  the  brassboard  is  to  have  a  working  algebraic- 
integer  RNS  processor  for  near-term  demonstration  purposes.  A  preliminary  simula¬ 
tion  analysis  comparing  integer  and  algebraic-integer  processing  in  the  case  of  an  FIR 
filter,  which  serves  to  indicate  the  scope  of  the  brassboard,  is  described.  A  final  section 
summarizes  the  conclusions  of  this  report,  including  some  final  thoughts  on  some  ap¬ 
plications  that  are  likely  to  benefit  the  most  from  using  the  algebraic-integer  number 
representation. 


7.1  ALGEBRAIC-INTEGER  BRASSBOARD 

An  algebraic-integer  brassboard  for  implementing  and  testing  the  quantization  and 
conversion  functions  described  in  this  report  is  being  planned.  In  addition  to  demon¬ 
strating  that  the  conversion  functions,  when  implemented  successively,  yield  the  identity 
function,  the  brassboard  will  be  used  to  configure  an  algebraic-integer  transversal  fil¬ 
ter.  Surplus  RNS  transversal  filter  chips,  which  were  fabricated  under  a  separate  effort 
for  an  experimental  wideband  high-frequency  communication  system,  will  be  used  to 
implement  the  processing  function.  The  performance  of  the  resulting  algebraic-integer 
transversal  filter  will  be  evaluated. 

Two  different,  but  related,  demonstrations  are  planned.  First,  a  real  algebraic- 
integer  processor  that  uses  the  4th  degree  extension  Z[\/2+\/2],  referred  to  as  the 
real  demonstration ,  will  be  built.  Two  of  the  primes  for  which  transversal  filter  chips 
are  available,  17  and  31,  are  suitable  for  this  ring.  The  second  processor,  referred  to  as 
the  complex  demonstration ,  will  use  the  8th  degree  cyclotomic  extension  Zje2*’/16].  In 
this  case,  only  the  chips  for  the  prime  17  can  be  used.  Appendix  B  lists  the  inner-level 
conversion  matrices  for  these  rings  and  primes. 
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Since  Z[\/2  +  \/2]  corresponds  to  the  real  numbers  in  Z[e2T*^16],  the  complex 
demonstration  can  take  advantage  of  the  hardware  developed  for  the  real  demonstration. 
The  two-stage  quantization  approach  of  section  3.2  will  be  used  in  the  real  demonstra¬ 
tion,  and  the  approach  in  section  3.3  of  treating  real  and  imaginary  parts  separately  will 
be  used  to  reduce  the  complexity  of  the  complex  quantization  problem.  Similarly,  the 
hardware  developed  for  requantization  in  the  real  demonstration  can  be  replicated  and 
used,  as  in  section  6.2,  to  perform  the  requantization  in  the  complex  demonstration. 

In  the  two-stage  quantization  approach  of  section  3.3,  tables  are  required  to  store 
the  algebraic- integer  approximations.  These  approximations  can  be  converted  ahead  of 
time  into  the  inner  level  of  RNS  parallelism  and  be  stored  in  the  table  in  this  form, 
eliminating  the  need  to  explicitly  implement  the  conversion  into  the  two  levels  of  RNS 
parallelism.  Of  course,  this  requires  a  table  for  each  prime  in  the  RNS.  The  conversion 
out  of  the  inner  level  of  parallelism  will  be  implemented  with  the  chip  described  in 
section  5.  The  real  demonstration  will  require  a  2  x  2  grid  of  these  chips  for  each 
modulus;  the  complex  demonstration  will  require  a  4  x  4  grid. 

The  conversion  in  the  brassboard  out  of  the  outer  level  of  RNS  parallelism,  which 
involves  integer  RNS,  will  either  be  performed  using  a  table-based  converter  (in  the  case 
of  the  real  demonstration  with  the  moduli  17  and  31)  or  not  be  required  at  all  (in  the 
case  of  the  complex  demonstration  with  only  the  modulus  17).  This  approach  is  consis¬ 
tent  with  the  fact  that  the  reduced  ranges  involved  in  algebraic-integer  RNS  decrease 
the  complexity  of  the  integer-RNS  outer  level  reconversion,  making  the  feasibility  of 
table-based  converters  more  likely.  The  final  requantization  will  be  implemented  using 
commercial  multiply-and-accumulate  chips. 


7.2  PERFORMANCE  OF  AN  ALGEBRAIC-INTEGER  FIR  FILTER 

This  section  reports  on  the  results  of  a  simulation  activity  that  was  undertaken  to 
determine  if  the  proposed  algebraic-integer-RNS  FIR  filter  planned  for  the  brassboard 
could  provide  a  meaningful  demonstration.  The  proposed  implementation  will  be  lim¬ 
ited  to  the  primes  17  and  31  with  a  dynamic  range  (per  coefficient)  of  17  x  31  =  527;  i.e. 
the  absolute  value  of  the  coefficients  at  the  output  must  be  less  than  or  equal  to  263. 
Three  different  filters  were  considered  in  this  preliminary  study.  The  actual  filters  to  be 
implemented  will  be  subjected  to  a  similar,  but  more  exhaustive,  simulation  study.  The 
present  effort  is  an  initial  attempt  at  comparing  integer  and  algebraic- integer  processing. 

A  simulation  of  an  FIR  filter  was  designed  in  order  to  (1)  evaluate  the  error  vs. 
range  performance  of  the  algebraic  integers  Z[B]m>  for 


B  =  {1,^2  +  >/2,  \/2,  s/'l  —  y/2  } , 
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and  (2)  compare  their  performance  to  that  of  the  integers.  The  simulation  consisted  of 
three  fundamental  blocks:  the  initial  quantization  of  the  input  data  (to  either  integers 
or  algebraic  integers);  the  filtering  operation,  where  the  quantized  data  was  convolved 
with  the  quantized  filter  coefficients;  and  the  final  requantization  for  the  algebraic- 
integer  case.  In  this  application,  finite-precision  requantization  can  be  performed  with 
essentially  no  loss  in  performance  without  having  to  resort  to  a  second  RNS  processor. 
Thus,  in  this  preliminary  study,  the  requantization  is  performed  using  floating-point 
numbers,  corresponding  to  it  being  performed  by  commercial  multiply-and-accumulate 
chips  of  more  than  adequate  accuracy.  The  primary  purpose  of  this  study  is  to  deter¬ 
mine  the  range  requirements  of  the  respective  RNS  processors  for  equivalent  levels  of 
performance. 

The  first  task  in  setting  up  a  simulation  of  this  type  is  the  choice  and  design  of  a  set 
of  filters  whose  coefficients  are  drawn  from  either  L\f ,  indicating  the  integers  modulo  A/, 
or  Z[B]a/.  Unfortunately,  finite-wordlength  filter  design  is  an  open  problem  at  present. 
More  specifically,  for  most  filter  design  criteria  a  deterministic  way  of  finding  the  best 
coefficients  drawn  from  the  available  set  is  not  known,  short  of  exhaustive  search.  This 
is  mainly  due  to  the  difficulty  of  fitting  the  discrete,  finite-wordlength  constraints  into 
the  analytic  methods  employed  in  filter  design. 

The  current  practice  in  finite-wordlength  filter  design  is  to  design  an  infinite- 
wordlength  filter,  then  approximate  it  in  some  manner  with  finite-wordlength  repre¬ 
sentatives.  The  unexpected  problems  that  occur  in  this  process  can  be  seen  in  the 
following  example.  A  lowpass  filter  was  designed  and  approximated  by  two  finite- 
wordlength  sets — Z[B]io  and  Z[B]i2-  The  approximations  were  chosen  so  as  to  mini¬ 
mize  the  squared  error  of  the  finite-wordlength  filter  coefficients.  Naturally,  the  filter 
designed  using  M  =  12  had  a  lower  squared  error  than  that  using  M  =  10.  However, 
on  a  set  of  test  input  data  consisting  of  two  sinusoids,  one  in  the  passband  and  one 
in  the  stopband,  plus  some  additive  white  Gaussian  noise  (AWGN),  the  M  =  10  filter 
performed  better  than  the  M  =  12  filter  even  though  the  input  data  were  quantized  less 
accurately  in  the  M  =  10  case.  This  occurred,  even  though  its  total  squared  error  was 
higher,  because  the  M  =  10  filter  preserved  a  frequency  notch  better  than  the  other 
filter  and  one  of  the  sinusoids  happened  to  fall  close  to  that  notch.  It  is  important  to 
remember  that  squared  error  is  not  always  the  best  approximation  performance  mea¬ 
sure,  and  that  a  filter  approximation  that  is  good  in  one  particular  situation  may  not 
be  suitable  in  another. 

For  the  simulation  under  discussion,  the  filters  were  designed  by  first  designing 
an  infinite-wordlength  filter  of  the  desired  type  and  then  approximating  the  infinite- 
wordlength  coefficients  with  the  closest  scaled  representative  in  Z^  or  Z[B]a/.  There 
were  three  infinite-wordlength  filters  designed — a  55-tap  filter  with  three  stopbands 
and  two  passbands,  a  60-tap  lowpass  filter,  and  a  61-tap  highpass  filter.  These  fil- 
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ters  were  approximated  by  Z[B]a/  for  M  =  2,  4,  6,  8,  10,  12,  and  by  1m  for  M  = 

96  ry7  98  99  9IO  9II  9I2 

i  —  9- 

The  input  data  fed  into  the  filters  consisted  of  a  combination  of  sinusoids  with 
AWGN.  There  were  ten  such  data  sets  computed,  each  150  samples  long  and  consisting 
of  a  combination  of  one  or  more  of  the  frequencies  0.025,  0.125,  0.2,  0.33,  and  0.45. 
These  frequencies  were  chosen  because  none  of  them  lay  in  the  don’t  care  bands  of  any 
of  the  three  basic  types  of  filters.  The  55-tap  multiband  filter  was  run  with  all  ten  sets 
of  data,  while  the  other  two  were  run  on  only  two  of  the  sets  (the  same  two  in  each 
case). 

Each  run  consisted  of  a  particular  set  of  input  data  quantized  either  by  1m  or 
1\JA\m  (using  the  two-stage  conversion  method  of  section  3.2).  These  inputs  were  then 
convolved  with  a  filter  approximated  by  the  same  set.  Along  with  integer  and  algebraic- 
integer  processing,  a  baseline  wa s  established  for  each  input-data/filter  combination  by 
filtering  the  floating-point  input  data  directly  with  the  original  infinite- wordlength  filter. 
The  outputs  of  this  process  were  used  as  the  standard  against  which  errors  occurring 
in  the  integer  or  algebraic-integer  processing  could  be  measured. 

The  error  was  measured  in  terms  of  signal-to-noise  ratio  (SNR)  in  the  same  manner 
as  in  equation  (6.6),  i.e., 

2 

SNR=10log,„  ,  (7.1) 

l*,m  -<■>) 

where  Oj  is  the  jth  output  of  the  floating-point  run,  and  a}  is  the  jth  output  of  the  in¬ 
teger  or  algebraic-integer  process.  The  other  statistic  one  is  interested  in  is  the  dynamic 
range  required  to  contain  the  results.  Ideally,  one  would  like  to  minimize  the  dynamic 
range  while  maximizing  the  SNR.  The  trade-off  between  these  two  statistics  is  given  in 
tables  11  and  12  for  one  particular  set  of  input  data  run  on  all  the  filter  approximations. 
Table  11  is  for  algebraic-integer  processing,  while  table  12  is  for  integers.  The  input 
data  set  consisted  of  a  unity-amplitude  sinusoid  with  frequency  0.2,  a  unity-amplitude 
sinusoid  with  frequency  0.33,  and  AWGN  with  power  0.01.  The  algebraic-integer  range 
shown  was  the  absolute  value  of  the  largest  coefficient  output  that  occurred,  while  the 
integer  range  was  the  absolute  value  of  the  largest  integer  output  that  occurred. 

The  results  in  tables  11  and  12  should  only  be  compared  for  M  =  2,  4,  6,  8,  since 
for  larger  values  of  M  the  front-end  12-bit  A/D  converter  in  the  two-stage  quantization 
method  becomes  a  limiting  factor  on  the  accuracy  of  the  algebraic-integer  representa¬ 
tions  of  the  input  data  (see  section  3.4).  Although  the  fact  that  the  filter  coefficients 
are  being  approximated  directly  does  result  in  some  performance  gain  over  the  12-bit 
integer  case  for  M  =  10,  12. 

Consider  M  =  4  for  example.  The  performance  of  filter  #1,  48.15  dB,  corresponds 
to  a  performance  for  the  integer  filter  of  between  8  and  9  bits;  filter  #2  is  slightly  below 
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Table  11.  Range  vs.  SNR  for  Algebraic  Integers 


Filter  #1 

Filter  #2 

Filter  #3 

M 

Range 

SNR 

Range 

SNR 

Range 

SNR 

2 

42 

29.86 

30 

22.82 

53 

27.93 

4 

147 

48.15 

113 

41.50 

106 

43.60 

6 

187 

52.80 

221 

54.42 

236 

54.46 

8 

386 

63.68 

498 

63.57 

383 

59.80 

10 

526 

67.90 

672 

66.16 

583 

66.97 

12 

799 

69.53 

853 

68.94 

830 

69.55 

Table  12.  Range  vs.  SNR  for  Integers 


Filter  #1 

Filter  #2 

Filter  #3 

bits 

Range 

SNR 

Range 

SNR 

Range 

SNR 

6 

1172 

22.30 

986 

28.29 

1046 

22.49 

7 

5115 

32.21 

3937 

38.65 

3867 

36.45 

8 

20144 

44.34 

15871 

42.96 

15642 

40.62 

9 

81343 

50.18 

64135 

47.32 

62863 

50.89 

10 

324876 

53.59 

255207 

49.86 

251371 

54.83 

11 

1301579 

61.26 

1025157 

59.58 

1006421 

60.39 

12 

5103488 

63.43 

4221821 

62.21 

3917674 

61.29 

8-bit  performance;  and  filter  #3  is  between  8  and  9  bits.  These  levels  are  consistent 
with  the  accuracy  of  8.53  bits  for  the  quantizer  Z[B]<  against  a  uniform  input  (table  4 
of  section  3.4),  although  the  present  input  is  most  certainly  not  uniformly  distributed. 

The  equivalent  integer  filters  for  M  —  2,  6,  and  8  are  also  consistent  with  the 
accuracies  of  the  respective  algebraic-integer  quantizers  Z[B]Af.  Some  anomalies  are  to 
be  expected,  due  to  the  problem,  mentioned  earlier,  with  using  SNR  as  a  performance 
measure.  In  summary,  the  results  in  tables  11  and  12  do  not  present  any  surprises,  and 
it  will  be  assumed  that  the  integer  equivalent  of  an  algebraic-integer  FIR  filter  can  be 
obtained  by  a  bit  level  that  approximates  the  inputs  and  filter  coefficients  to  the  same 
level  of  accuracy. 

An  equally  important  point  of  comparison  is  the  dynamic  ranges  required  by  the 
two  processing  methods.  To  get  an  idea  of  the  largest  range  one  might  be  called  upon 


to  handle,  the  absolute  values  of  the  largest  range  encountered  over  all  the  runs  using 
the  55-tap  filter  were  determined  for  each  M.  The  other  two  filters  were  not  included 
so  that  filter  length  would  be  a  constant  factor  in  the  measured  ranges.  The  results 
for  integers  and  algebraic  integers  are  shown  below  in  table  13.  It  should  be  mentioned 
that  the  dynamic  range  for  the  algebraic  integer  M  =  12  case  (1312)  was  an  extreme 
outlier — the  next  largest  output  coefficient  encountered  in  that  or  any  other  run  was 
974. 


Table  13.  Summary  of  Largest  R  anges 


Algebraic 

Integers 

Integers 

M 

Range 

bits 

Range 

2 

48 

6 

2008 

4 

164 

7 

8532 

6 

239 

8 

34896 

8 

524 

9 

138586 

10 

705 

10 

559369 

12 

1312 

11 

2234533 

— 

12 

8946667 

The  ranges  in  table  13  suggest  that  the  brassboard,  with  the  primes  17  and  31 
corresponding  to  an  absolute  value  range  of  263,  could  implement  the  55-tap  filter  for 
M  —  6.  A  corresponding  integer  filter  involves  approximations  of  between  10  and  11.42 
bits  (for  uniform  and  Gaussian  inputs;  see  tables  4  and  8)  and  requires  a  range  of  at 
least  559, 369.  If  5-bit  (or  less)  primes  are  to  be  used,  then  the  integer  processor  requires 
at  least  5  moduli  in  the  RNS  ((31  x  29  x  23  x  19  -  l)/2  =  196,431;  (31  x  29  x  23  x 
19  x  17—  l)/2  =  3, 339, 335).  For  Gaussian  inputs,  this  RNS  may  not  even  suffice  since 
the  range  would  be  closer  to  5,000,000. 

It  should  be  noted  that  the  algebraic-integer  system,  although  only  requiring  2 
moduli  in  the  RNS,  has  4  processing  channels  per  moduli  for  a  total  of  8.  This  in¬ 
efficiency,  8  channels  versus  5  or  6,  may  be  due  to  the  fact  that  each  coefficient  of 
the  algebraic-integer  number  representation  experiences  its  own  additive  growth,  which 
in  this  example  is  between  5  and  6  bits.  This  effect  would  worsen  for  longer  filters. 
However,  a  5-  or  6-moduli  RNS  output  converter  should  be  much  more  complex  than 
a  2-moduli  RNS  output  converter  replicated  4  times  (for  each  of  the  algebraic-integer 
coefficients). 
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Finally,  only  the  nonpolynomial  basis  B  was  used  in  the  simulations  because  the 
polynomial  basis  resulted  in  larger  ranges.  For  example,  a  run  with  the  polynomial  basis 
that  yielded  a  range  of  72  for  M  =  2  compared  to  a  range  of  48  for  the  nonpolynomial 
basis  under  the  same  conditions. 

7.3  CONCLUSION 

This  paper  addressed  the  quantization  and  conversion  problems  involved  in 
algebraic-integer  RNS  processing.  The  algebraic-integer  number  representation  was  in¬ 
troduced  using  four  important  examples:  cyclotomic  extensions  involving  8th  and  16th 
roots  of  unity,  and  their  respective  real  subrings.  A  primary  concern  when  choosing 
a  basis  for  the  algebraic-integer  representation  is  to  minimize  the  growth  in  coefficient 
size  due  to  multiplication.  This  growth  depends  on  the  size  of  the  entries  in  the  co¬ 
efficient  forms,  which  are  matrices  that  describe  the  relationships  that  exist  between 
the  basis  elements.  These  relationships  depend,  in  part,  on  the  minimum  polynomial 
involved  in  the  algebraic-integer  extension.  In  the  4th  degree  real  example  considered, 
this  meant  processing  with  a  nonpolynomial  basis.  The  complexity  of  input  and  output 
phases  of  a  complex  implementation  (I  and  Q  channels)  can  be  reduced  by  representing 
the  complex  algebraic  integers  in  terms  of  a  product  basis,  although  it  is  best,  from  a 
dynamic  range  viewpoint,  to  process  using  the  complex  polynomial  basis. 

Algebraic-integer  RNS  was  introduced  as  a  generalization  of  QRNS  for  the  Gaus¬ 
sian  integers.  In  algebraic-integer  RNS,  each  parallel  channel  of  integer  RNS  (the  outer 
level  of  parallelism)  is  split  further  into  another  level  of  parallel  channels  (the  inner 
level  of  parallelism).  This  further  splitting,  for  a  modulus  m,  makes  the  complicated 
algebraic-integer  product  easy  to  compute  in  parallel,  but  is  only  possible  when  the 
minimum  polynomial  of  the  algebraic-integer  extension,  considered  modulo  m.  factors 
completely  into  distinct  linear  terms. 

Unlike  the  Gaussian  integers,  which  form  the  quantizer  used  in  QRNS  and  cor¬ 
respond  to  a  two-dimensional  integer  lattice,  the  algebraic  integers  form  a  dense  rep¬ 
resentation  and  are  a  rich  source  of  quantization  problems.  In  practice,  algebraic- 
integer  quantizers  are  derived  by  bounding  the  size  of  the  algebraic-integer  coefficients. 
Two  strategies  for  direct  analog-to-(real)algebraic-integer  conversion  were  described, 
one  based  on  the  compressor  characteristic  of  the  nonuniform  algebraic- integer  quan¬ 
tizer,  and  the  other  based  on  a  generalization  of  a  successive-approximation  converter. 
A  suboptimal  and  straightforward  two-stage  method  of  real  quantization  was  also  pro¬ 
posed. 

In  the  important  case  that  was  considered,  the  suboptimal  method  performed 
equivalently  to  the  direct  method  as  long  as  the  conventional  A/D  converter,  which 
forms  the  front  end  of  the  two-stage  method,  has  one  bit,  more  conservatively  two  bits, 
of  precision  beyond  the  precision  of  the  direct  algebraic-integer  quantizer.  Thus,  further 
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hardware  development  of  a  direct  analog-to-algebraic-integer  converter  is  only  required 
for  applications  requiring  processing  at  the  speed  and  accuracy  limits  of  current  A/D 
technology.  By  treating  real  and  imaginary  parts  separately,  complex  quantization  was 
reduced  to  the  real  case,  with  a  performance  loss  conjectured  to  be  just  a  doubling  in 
the  range  of  the  coefficients. 

The  algebraic-integer  quantizers  yield  a  distributed  representation — a  single  large 
representative  is  replaced  by  a  vector  of  smaller  representatives.  For  the  same  level 
of  accuracy,  the  total  cost  (measured  by  the  number  of  points)  of  the  conventional 
and  algebraic- integer  quantizers  are  comparable,  especially  when  issues  of  robustness 
are  considered.  Thus,  in  the  case  considered — a  degree  4  real  extension — for  the  same 
level  of  accuracy,  a  single  representative  with  a  range  R  is  replaced  by  a  vector  of  4 
coefficients,  each  with  a  range  of  about  \/~R.  This  dramatic  reduction  in  range  at  the 
input  implies  a  smaller  output  range,  hence  a  smaller  RNS  can  be  used,  although  the 
advantage  is  somewhat  mitigated  by  the  additional  coefficient  size  growth  due  to  the 
algebraic-integer  multiplication.  The  smaller  RNS  range  reduces  the  complexity  of  the 
outer  level  integer-RNS  conversion. 

Because  the  algebraic-integer  coefficients  at  the  beginning  of  the  calculation  are 
likely  to  be  small,  the  conversion  into  the  outer  channels  will  usually  be  straightfor¬ 
ward.  Conversion  out  of  these  outer  channels  will  require  integer-RNS  output  conver¬ 
sion.  The  fractional  representation  method  performs  this  output  conversion  without  the 
large  modulo  reduction  required  by  the  Chinese  Remainder  Theorem.  The  method  was 
modified  slightly  to  get  rid  of  all  the  modular  operations  at  the  cost  of  needing  slightly 
higher  precision  approximations  of  the  fractional  quantities  involved  in  the  method. 
As  a  result,  an  implementation  with  conventional  arithmetic  is  now  possible,  making 
programming  to  various  RNSs  easy.  A  comparison  with  mixed-radix  conversion  was 
made  for  a  particular  RNS,  and  it  was  shown  that  the  active  VLSI  area  of  a  fractional 
representation  converter  was  always  less  than  that  of  an  equivalent  mixed-radix  one. 
The  VLSI  implementation  of  this  converter  is  being  developed  under  a  separate  effort. 

The  conversion  into  and  out  of  the  inner  level  of  parallel  channels,  for  a  modulus  m, 
is  a  modulo  m  vector-matrix  multiplication.  This  involves  a  sum  of  products  modulo  m, 
and  so  is  similar  to  the  functions  most  likely  to  be  implemented  by  the  RNS  processing 
channels;  i.e.,  the  complexities  are  similar  also.  An  architecture  based  on  the  LU 
decomposition  of  the  transform  matrix  was  developed  that  performs  this  product  in 
parallel.  The  LU  architecture  consists  of  a  grid  of  basic  cells  and  is  semisystolic — the 
cells  in  each  row  must  communicate.  A  chip  consisting  of  a  2  x  2  grid  of  basic  cells 
was  designed  in  scalable  CMOS.  Each  basic  cell  performs  a  multiply  and  accumulate 
modulo  a  programmable  modulus.  The  modulus,  and  hence  all  values  on  the  chip,  are 
limited  to  8  bits.  The  design  can  be  clocked  to  3  MHz  and  is  currently  undergoing 
testing  before  fabrication  (in  3  //m  technology).  A  degree  n  conversion  (either  into  or 
out  of  the  inner  level)  will  require  an  n/2  x  n/2  grid  of  these  chips  for  each  modulus. 
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After  RNS  processing  is  completed,  including  the  conversions  out  of  the  two  levels 
of  parallelism,  the  answer  is  obtained  in  algebraic- integer  form.  To  obtain  a  binary  or 
analog  result,  this  algebraic  integer  must  be  evaluated  using  finite-precision  approxima¬ 
tions  of  the  basis  elements.  Rather  than  straightforwardly  rounding  the  basis  elements 
to  some  prescribed  number  of  bits,  scaling  and  rounding  should  be  employed  to  mini¬ 
mize  the  average  squared  error  of  these  approximations.  Since  the  output  coefficients 
are  usually  quite  large,  there  is  some  concern  that  the  basis  elements  may  have  to  be  ap¬ 
proximated  with  equally  extreme  precision.  In  the  applications  considered  to  date,  this 
has  not  been  the  case — basis  approximations  with  a  precision  matching  the  precision 
of  the  inputs  are  all  that  have  been  required  for  there  to  be  no  significant  performance 
degradation  due  to  requantization.  There  is  no  guarantee  that  this  heuristic  will  apply 
in  general. 

An  error  analysis  showed  that  for  a  processing  function  consisting  of  a  sum  of 
products,  where  the  products  were  assumed  to  be  orthogonal  random  variables,  the 
performance  degradation  due  to  requantization,  measured  in  terms  of  the  decrease  in 
output  SNR,  is  independent  of  the  output  algebraic-integer  coefficient  size  and  the 
length  of  the  summation.  However,  the  magnitudes  of  the  products  do  adversely  affect 
the  requantization  error,  and  so  a  significantly  different  processing  situation  has  to  be 
treated  on  an  individual  basis. 

The  requantization  can  be  performed  with  commercial  multiply-and-accumulate 
units  or  even  a  separate  RNS  processor  if  accuracy  and/or  throughput  requirements 
dictate.  In  the  latter  case,  the  algebraic-integer  output  conversion  can  be  simplified 
in  the  special  case  that  the  dynamic  range  growth  due  to  requantization  is  negligible. 
This  would  be  the  case  if  the  final  results  correspond  to  very  small  positive  numbers. 

In  the  future,  an  algebraic-integer  brassboard  is  planned  to  test  the  quantizers 
and  converters  that  have  been  described  in  this  report.  Current  plans  also  include 
implementing  an  algebraic-integer  RNS  FIR  filter  in  the  brassboard.  A  processor  based 
on  a  4th  degree  real  extension  will  utilize  existing  processor  hardware  for  the  primes 
17  and  31.  A  related  processor  for  I  and  Q  processing  will  use  an  8th  degree  complex 
extension  and  the  prime  17.  A  preliminary  simulation  study  showed  that,  for  a  55- 
tap  filter,  to  obtain  performance  equivalent  to  the  real  algebraic- integer  RNS  processor 
with  moduli  {17,  31},  an  integer  RNS  processor  would  require  the  5-bit  prime  moduli 
{17,  19,  23,  29,  31}.  A  sixth  modulus  may  be  required  depending  on  the  distribution 
of  the  inputs  and  filter  coefficients. 

The  algebraic-integer  FIR  filter  implementation  is  not  intended  to  address  the 
fundamental  question  concerning  the  ultimate  utility  of  the  algebraic-integer  number 
representation.  Rather,  it  is  a  near-term  demonstration  tool,  which  can  be  completed 
using  existing  processing  chips,  to  be  used  to  compare  the  integers  and  algebraic  integers 
in  a  situation  in  which  we  have  considerable  prior  integer-RNS  experience. 
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What  application  should  be  used  to  demonstrate  the  advantages  of  the  algebraic- 
integer  number  representation?  It  is  expected  that  the  application  will  have  more  than 
one  of  the  following  characteristics:  (1)  sum(s)  of  products  function  (products  not  nec¬ 
essarily  limited  to  two  terms),  (2)  high  sensitivity  to  quantization  and  round-off  errors, 
(3)  matching  nonuniform  input  distributions,  (4)  high  dynamic  range  requirement,  and 
(5)  high  throughput  (and  low  latency)  requirement.  There  are  relationships  between 
these  characteristics.  For  example,  any  modest  increase  in  quantization  performance 
due  to  (3)  will  translate  into  considerable  output  performance  improvement  because  of 
(2).  Also,  difficult  to  quantify  characteristics  such  as  ease  of  design  or  the  need  for  fault 
tolerance  may  enter  the  decision. 

Most  importantly,  though,  throughput  and  latency  requirements  probably  will  im¬ 
ply  that  some  type  of  parallel  processing  is  required.  Algebraic  integers,  as  a  parallel 
processing  scheme,  are  not  likely  to  be  preferred  on  area  considerations  alone.  There 
is  an  inherent  inefficiency  with  the  algebraic-integer  number  representation  (for  that 
matter  with  any  distributed  number  representation):  additive  growth  occurs  in  each 
coefficient.  An  application  is  needed  with  speed  and  accuracy  requirements  that  force 
integer  processing,  and  dynamic  ranges  so  large  that  integer  RNS  is  impractical.  Then 
the  dynamic  range  relief  offered  by  the  distributed  algebraic-integer  representation  may 
be  essential  in  obtaining  a  practical  implementation.  High-performance  adaptive  re¬ 
cursive  filtering  applications  or  various  types  of  matrix  processing,  where  the  matrices 
involved  can  have  high  condition  numbers,  could  be  the  source  of  an  appropriate  appli¬ 
cation. 
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APPENDIX  A 

LU  DECOMPOSITION 


The  purpose  of  the  section  is  to  present  some  results  about  the  LU  decomposition 
of  a  matrix  that  are  needed  in  this  paper.  Let  A  be  a  nonsingular  n  x  n  matrix  over 
some  field.  Then  A  is  L  [/-decomposable  if  there  exists  a  lower- triangular  matrix  L  and 
an  upper-triangular  matrix  U  such  that  A  =  LU.  Necessarily,  the  diagonals  of  L  and  U 
are  nonzero;  no  other  restriction  is  made  on  the  diagonals.  The  requirement  that  A  be 
nonsingular  is  not  necessary  for  some  of  the  results  that  follow;  this  requirement  is 
added  to  ease  the  exposition. 

An  LU  decomposition  for  a  matrix  is  desirable  for  the  added  information  that  it 
furnishes  and  the  simplification  that  it  can  give  to  calculations.  For  example,  the  LU 
architecture  in  section  5.2  uses  the  LU  decomposition  of  a  matrix  to  perform  matrix 
multiplication.  For  a  reference  on  LU  decomposition,  see,  for  example,  [20]  . 

Almost  all  of  the  results  in  this  section  are  well-known  (see  [20]  );  the  exception 
is  Theorem  A. 2,  for  which  no  reference  has  been  found  as  yet.  The  characterization  of 
when  a  matrix  is  LU-decomposable  is  given  in  the  following  theorem.  The  leading  i  x  i 
principal  minor  of  a  matrix  M  is  denoted  by  Af,. 


THEOREM  A.l.  A  nonsingular  n  x  n  matrix  A  is  LU-decomposable  iff  the  matrices 
Ai,  A?, . . .  ,  A„_i  are  all  nonsingular. 


PROOF:  Suppose  that  A  is  LU-decomposable  and  let  A  =  LU  be  the  resulting  decom¬ 
position.  For  i  =  1,2,...  , n  —  1,  block  arithmetic  yields  A,  =  LiUx.  Since  L,  and  U, 
are  clearly  nonsingular,  A{  must  also  be  nonsingular. 

Conversely,  let  Aj,  A2, . . .  ,  A„_i  be  nonsingular.  By  induction,  An_i  is  LU- 
decomposable.  Let  An_i  =  L'U'  be  the  decomposition.  Then  A  can  be  factored 
as 


(A.l) 


where  6n  1 , . . .  ,6nn,6in,...  are  determined  from  L\  U\  and  the  atJ,  and  blanks 

denote  Os.  Notice  that  in  the  last  product,  the  first  matrix  is  lower-triangular  and  the 
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third  is  upper-triangular.  Since  the  matrix  product 


( 1 

\ 

(  1 

bln\ 

y - 

1 

O- 

9 

i—* 

1 

—  bn,n— 1  1  / 

\  bn  1 

1 

^nn  / 

is  upper-triangular,  the  middle  matrix  in  (A.l)  is  LU-decomposable.  Therefore,  A  is 
LU-decomposable.  | 

Implicit  in  the  above  proof  is  one  method  for  calculating  an  LU  decomposition: 
given  A,  one  performs  Gaussian  elimination  on  the  rows  and  columns  to  reduce  A  to 
an  identity  matrix,  keeping  track  of  the  elementary  row  and  column  operations  used. 
These  operations  can  be  expressed  as  a  lower-triangular  matrix  and  an  upper-triangular 
matrix.  The  inverses  of  these  matrices  give  the  LU  decomposition. 

As  an  example,  consider  a  Vandermonde  matrix  V  (see  section  5.1).  Since  a 
principal  minor  of  a  Vandermonde  matrix  is  itself  a  Vandermonde  matrix,  Theorems  2.1 
and  6.1  and  induction  imply  that  a  Vandermonde  matrix  is  LU-decomposable.  This 
was  shown  by  constructive  methods  in  section  5.1. 

The  following  theorem  shows  that  any  nonsingular  n  x  n  matrix  can  be  made 
LU-decomposable  by  multiplication  by  an  appropriate  permutation  matrix. 

THEOREM  A. 2.  Let  A  be  a  nonsingular  nxn  matrix.  Then  there  exists  a  permutation 
matrix  P  such  that  PA  is  LU-decomposable. 

PROOF:  The  proof  is  by  induction.  For  n  =  2,  let 

A  =  (“»  a”)  . 

\a  21  022/ 

Then  either  an  or  a 21  is  nonzero,  since  A  is  nonsingular.  Permute  the  rows,  if  necessary, 
to  put  a  nonzero  element  in  the  ull”  spot.  The  resulting  matrix  is  LU-decomposable 
by  Theorem  A.l. 

Let  A  be  a  nonsingular  nxn  matrix  and  suppose  that  the  result  is  known  for  all 
nonsingular  (n  —  1)  x  (n  —  1)  matrices.  Form  all  (n  —  1)  x  (n  —  1)  minors  of  A  with 
columns  from  the  first  n  —  1  columns.  Then  one  of  these  minors  is  nonsingular,  otherwise 
calculating  the  determinant  of  A  by  expanding  along  the  last  column  would  imply  that 
A  is  singular.  Hence,  the  rows  of  A  can  be  permuted  so  that  the  (n  —  1)  x  (n  —  1) 
principal  minor  An_i  is  nonsingular.  That  is,  for  some  permutation  matrix  P, 

PA  =  ,  A„_i  nonsingular. 
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By  the  induction  hypothesis,  there  is  an  (n  —  1)  x  (n  —  1)  permutation  matrix  P'  such 
that  P'An-i  is  LU-decomposable.  In  particular,  all  of  the  principal  minors  of  P' An- 1 
are  nonsingular.  Then  so  are  all  of  the  principal  minors  of 


Applying  Theorem  A.l  finishes  the  result.  | 

Suppose  that  a  nonsingular  matrix  A  is  LU-decomposable  with  decomposition 
A  =  LU .  If  D  is  any  nonsingular  diagonal  matrix,  then 

LU  =  LDD~lU  =  ( LD){D~lU ). 

Hence,  there  is  another  LU  decomposition  of  A  given  by  LD  and  D~lU.  The  next 
theorem  asserts  that  all  LU  decompositions  of  A  are  of  this  form. 

THEOREM  A. 3.  Let  A  be  a  nonsingular  LU-decomposable  matrix  with  decompositions 
LU  and  L'U' .  Then,  for  some  nonsingular  diagonal  matrix  D, 

L  =  L'D  and  U  =  D~lU' . 

PROOF:  Since  A  =  LU  =  L'U'  and  everything  is  invertible,  (L')~l  L  =  U'U~l.  The 
left  side  is  a  lower  triangular  matrix  and  the  right  side  is  an  upper-triangular  matrix, 
so  the  expression  is  some  (nonsingular)  diagonal  matrix.  | 

Theorem  A. 3  says  that  by  forcing  L  and  U  to  have  some  prescribed  diagonals,  say 
all  Is,  then  A  can  be  written  uniquely  as  A  =  LDU,  for  some  diagonal  matrix  D.  For 
the  purposes  of  this  paper,  it  is  advantageous  to  not  so  restrict  L  and  U  and  to  in  fact 
allow  the  diagonals  to  be  determined  as  the  context  requires. 
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APPENDIX  B 


SOME  ALGEBRAIC  INTEGERS 


In  this  appendix,  the  matrices  and  other  information  needed  to  perform  the  inner- 
level  conversion  are  given  for  those  rings  of  algebraic  integers  and  primes  that  are  likely 
to  be  used  in  some  near-term  implementations.  Each  ring  of  algebraic  integers  is  of  the 
form  Z [/?]  modulo  some  prime  p.  Algebraic  integers  represented  with  the  standard  basis 
are  of  the  form 


ag  -f-  a\/3  +  *■•  +  Qd— \(3*  1 1 


(B.l) 


where  the  a,  are  modulo  p,  and  d  is  the  degree  of  the  extension.  The  minimum  poly¬ 
nomial  of  0  and  its  roots  modulo  p  are  also  given;  the  order  of  the  roots  is  consistent 
with  the  matrices. 

If  more  than  one  basis  is  considered,  each  is  represented  as  {/ 3q , . . .  ,  0d-\},  so  that 
the  algebraic  integers  are  of  the  form 


aoflo  +  ai/?i  +  ■••  +  ad- 1 /3d- 1  ■ 


The  change-of-basis  matrices  to  and  from  the  standard  representation  of  (B.l)  is  given. 
The  entries  of  the  change  of  basis  matrices  are  rational  numbers  and  are  in  fact  integers 
whenever  the  two  bases  are  both  integral  bases.  The  other  matrices  are  consistent  with 
the  order  of  the  basis  and  have  entries  from  Zp. 

The  conversion,  respectively  reconversion,  matrices  are  denoted  by  V  and  V~l 
(though  these  matrices  need  not  be  Vandermonde  matrices).  The  matrices  in  the  LU 
architecture  are  L and  Uc  for  conversion  and  and  Ur  for  reconversion. 

Each  of  the  Rj  below  is  a  ring  of  algebraic  integers  that  will  be  described  in  this 
section.  Each  of  the  associated  Bjk  is  a  basis  which  generates  the  algebraic-integer 
approximations  taken  from  Rj.  It  is  usually  the  case,  but  not  always,  that  each 
generates  all  of  Rj  (and  so  is  an  integral  basis  of  Rj);  any  exceptions  are  noted.  The 
standard  basis  is  always  Bj\.  The  order  of  the  Rj  is  the  order  that  the  information  is 
presented  in  this  section.  To  avoid  confusion,  i  here  always  denotes  \f— T: 
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P=  17 


i?3  =  Z[u;] ,  u>  primitive  eighth  root  of  unity,  p  =  17 
B31  =  { 1  ,  a; ,  a;2  ,  w3  } 


R\  —  Z[a>] ,  u>  primitive  sixteenth  root  of  unity,  p  —  17 
B\\  =  { 1 ,  u) ,  oT  ,  u;3  ,  a;4  ,  w5  ,  w6  ,  u;7 } 

542  =  {l,  V  2  +  \/2 ,  \/2 ,  >/2-  x/2,  i,  >/2  +  \/2i ,  \/2i ,  \/2  -  \/2i}  . 


114 


EXAMPLE  1:  Ring  Rx 
The  ring  is 

z[V2W?|  modulo  17. 

The  minimum  polynomial  of  \J2  -f  \/2  is 

f(x)  =  x4  —  4x2  4-  2 . 

The  roots  of  f(x)  modulo  17  are 

5,  8,  9,  12. 

Ring  R\  with  basis  B\\ 

The  basis  is  ^  ^  ^ 

|l,  y/2  -f  \/2 ,  {V~2 T^)“  ,  (v^T^)3}  . 

Conversion  Matrices: 


V  = 


/I 

1 

1 

1  \ 

(3 

4 

5 

1  \ 

1 5 

8 

9 

12  1 

V~l  = 

6 

5 

12 

10 

8 

13 

13 

8 

6 

12 

12 

7 

\6 

2 

15 

11/ 

\  3 

13 

5 

16/ 

L~c  = 


/  1 

0 

0 

°\ 

/I 

1 

1 

1  \ 

I  12 

1 

0 

0  | 

IT  1  0 

3 

4 

7  1 

6 

4 

1 

° 

Uc  =  0 

0 

4 

11  J 

\  14 

4 

12 

1/ 

\o 

0 

0 

16/ 

lr  = 


/  1 

0 

0 

0  ^ 

(  4 

1 

1 

1\ 

1  4 

6 

0 

0 

Ur  = 

0 

1 

7 

8  | 

1  10 

1 

13 

0 

,0 

0 

1 

7 

V  3 

13 

5 

16  ) 

u 

0 

0 

1  / 
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Ring  R\  with  basis  B\i 
The  basis  is 

Change  of  Basis  Matrices: 


{1,  vO-  y/2,  V2-V2}  . 


To  Standard  Basis 
10  0  0 

0  10  0 

-2010 
0-301 


From  Standard  Basis 
10  0  0 
0  10  0 

2  0  10 

0  3  0  1 


Conversion  Matrices: 


V  = 


/ 1 

1 

1 

1  \ 

/ 13 

7 

5 

1  \ 

1  5 

8 

9 

12  1 

v-1  -  13 

1 

12 

10 

6 

11 

11 

6 

V  "13 

16 

12 

7  J 

\8 

12 

5 

9  / 

V  13 

10 

5 

16/ 

Lc1  = 


1  0  0  0\ 
12  1  0  0 
8  4  10 

4  7  12  1  / 


UC  = 


L~r1  = 


10  0  0 
4  6  0  0 

2  1  13  0 

13  10  5  16, 


Ur  = 
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EXAMPLE  2:  Ring  R2 
The  ring  is 


Z  n/'2  +  s/2 j  modulo  31 . 

The  minimum  polynomial  of  v2  +  >/2  is 

f{x)  =  x4  —  4x2  +  2 . 
The  roots  of  /(x)  modulo  31  are 

5,  14,  17,  26. 

Ring  R2  with  basis  1 

The  basis  is 

I1' 

Conversion  Matrices: 


( 1 

1 

1 

1  \ 

r10 

2 

30  6 

\ 

5 

14 

17 

26 

I/'1  = 

6 

27 

1  20 

25 

10 

10 

25 

6 

4 

1  11 

V  i 

16 

15 

30/ 

Wo 

29 

30  25  / 

/ 1 

0 

0 

°\ 

1 

1  1  ^ 

26 

1 

0 

0 

0 

9 

12  21 

8 

12 

l 

0 

Uc  = 

° 

0 

5  4 

V  19 

21 

26 

1/ 

Vo 

0 

0  5  / 

/  1 

0 

0 

0  > 

\ 

/i 

1 

1  1  ^ 

27 

7 

0 

0 

1 

0 

1 

22  23 

14 

21 

25 

0 

= 

0 

0 

1  7 

\10 

29 

30 

25/ 

'  V  o 

0 

0  1  ) 
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Ring  /?2  with  basis  B22 


The  basis  is  _ 

{l ,  V2+V2,  y/2,  V2-V5}  ■ 


Change  of  Basis  Matrices: 


To  Standard  Basis 


From  Standard  Basis 
10  0  0 
0  10  0 

2  0  10 

0  3  0  1 


Conversion  Matrices: 


V  = 


L~R 


/ 1 

1 

1 

1  > 

(  ® 

20 

30 

6 

5 

14 

17 

26 

v~l  = 

8 

25 

1 

20 

23 

8 

8 

23 

1 

8 

6 

1 

11 

\  17 

5 

26 

14; 

U 

11 

30 

25 

/  1 

0 

0 

°\ 

(l 

1 

1 

1  V 

26 

10 

1 

12 

0 

1 

0 

0 

UC  = 

'  0 
i  0 

9 

0 

12 

5 

21 

4 

\  9 

24 

26 

\) 

u 

0 

0 

5  ) 

/  1 

0 

0 

0  > 

\ 

(\ 

1 

1 

1  V 

27 

7 

0 

0 

]  UR  = 

0 

1 

22 

23 

2 

21 

25 

0 

0 

0 

1 

7 

V  8 

11 

30 

25/ 

/ 

Vo 

0 

0 

1  ) 
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EXAMPLE  3:  Ring  R3 

The  ring  is 

Z[u>] ,  u>  primitive  eighth  root  of  unity,  modulo  17 . 
The  minimum  polynomial  of  u>  is 


f(x)  =  i4  +  1 . 


The  roots  of  f(x)  modulo  17  are 

2,  8,  15,  9. 

Ring  R3  with  basis  B3\ 

The  basis  is 

{l ,  u>,  u2  ,  u/3}  . 

Conversion  Matrices: 


V  = 


Lc 


/I 

1 

1 

1  ' 

V  / 

=  1 2 

8 

15 

9 

l  4 

13 

4 

13 

\8 

2 

9 

15  / 

/  \ 

/  1 

0 

0 

°\ 

/ 

1  15 

1 

0 

0  ) 

Uc  = 

7 

1 

0 

\15 

13 

9 

i/ 

\ 

13 

15 

16 

13 

8 

1 

13 

2 

16 

13 

9 

1 

/  1  0  0  0\ 
11  3  0  0  ] 

14  4  3  0  I 

V 13  9  1  2/ 


Ur 


/ 1  1  1  1  \ 

0  15  4 

0  0  14 

\0  0  0  1/ 
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EXAMPLE  4:  Ring 
The  ring  is 

Z[u>] ,  u)  primitive  sixteenth  root  of  unity,  modulo  17 . 
The  minimum  polynomial  of  u  is 

f(x)  =  i8  +  1 . 

The  roots  of  /( i)  modulo  17  are 

3,  10,  5,  11 ,  14,  7,  12,  6. 


Ring  i?4  with  basis  B\\ 

The  basis  is 

{  1  ,  UJ ,  u1 ,  u/3  ,  ,  a/5  ,  a/6  ,  u>7  }  . 

Conversion  Matrices: 


/ 1 

1 

1 

1 

1 

1 

1 

1  \ 

3 

10 

5 

11 

14 

7 

12 

6 

9 

15 

8 

2 

9 

15 

8 

2 

10 

14 

6 

5 

7 

3 

11 

12 

13 

4 

13 

4 

13 

4 

13 

4 

5 

6 

14 

10 

12 

11 

3 

T 

1 

15 

9 

2 

8 

15 

9 

2 

8 

V  ii 

5 

10 

3 

6 

12 

7 

14/ 

/ 15 

5 

13 

10 

9 

3 

1 

6  \ 

15 

10 

1 

12 

8 

11 

13 

3 

15 

3 

4 

11 

9 

12 

16 

10 

15 

6 

16 

3 

8 

10 

4 

5 

15 

12 

13 

7 

9 

14 

1 

11 

15 

7 

1 

5 

8 

6 

13 

14 

15 

14 

4 

6 

9 

5 

16 

7 

V  15 

11 

16 

14 

8 

7 

4 

12/ 
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1  o  0  0  0  0  0  O' 

14  1  0  0  0000 

13  4  1  0  0000 

3  10  16  1  0  0  0  0 


1  12  4 
3  3  7 


14  8 


10  0  0 
8  10  0 


13  16  5  10  14  1  1  0 


12  2  6  1 


1111  1  l  1  1  N 

0728  11  4  9  3 

0  0  7  8  10  5  1  5 

0  0  0  14  5  10  7  5 

0  0  0  0  15  11  7  9 

0000  0  8  3  13 

0000  0  0  15  4 

k0  0  0  0  0  0  0  10, 


10  0  0 

2  5  0  0 


14 

3 

5 

0 

16 

8 

6 

11 

8 

11 

15 

6 

11 

11 

3 

13 

2 

9 

6 

12 

15 

11 

16 

14 

0  0  0  0 
0  0  0  0 
0  0  0  0 


0  0  0 


'1111111  p 
0  1  10  6  4  3  11  15 

0  0  1  6  16  8  5  8 

0  0  0  1  4  8  9  4 

0  0  0  0  1  3  5  4 

0  0  0  0  0  1  11  8 

0  0  0  0  0  0  1  15 

i0  0  0  0  0  0  0  1  , 
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Ring  i?4  with  basis  B42 
The  basis  is 


Change  of  Basis  Matrices: 


To  Standard  Basis 

/  1  0  0  0  0  0  0  0  \ 

0  1  0  0  0  0  0  -1 

0  0  1  0  0  0  -1  0 

0  0  0  1  0  -1  0  0 

0  0  0  0  1  0  0  0 

0  0  0  1  0  1  0  0 

0  0  1  0  0  0  1  0 

\  0  1  0  0  0  0  0  1/ 


From  Standard  Basis 
/I  0  0  0  0  0  0  0  \ 

oioooooj 
06  \  o  o  o  £  6 

00  0  i  0  ±  6  0 

0  0  0  0  1  0  0  0 

00  0  -±  0  £  0  0 

0  o-io  0  0  |  0 

\  o  -±  o  0  00  6  \ ) 

The  fractions  in  the  second  matrix  indicate  that  this  basis  does  not  generate  all  of  R\\ 
B42  is  not  an  integral  basis  of  R4. 
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Conversion  Matrices: 
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1 
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1  \ 
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12 
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12 
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11 
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11 
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6 

6 

11 

5 

8 

9 

12 

12 

9 

8 

5 

13 

4 

13 

4 

13 

4 

13 

4 

15 

3 

3 

15 

2 

14 

14 

2 

7 

7 

10 

10 

7 

7 

10 

10 

V  14 

15 

15 

14 

3 

2 

2 

3  / 

/!5 

8 

6 

12 

9 

15 

7 

14^ 

15 

12 

11 

9 

8 

3 

7 

15 

15 

5 

11 

8 

9 

3 

10 

15 

15 

9 

6 

5 

8 

15 

10 

14 

15 

9 

6 

5 

9 

2 

7 

3 

15 
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8 
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14 

7 

2 

15 

12 

11 

9 

9 

14 

10 

2 

\  15 

8 

6 

- 

12 

8 

2 

10 

3  ) 

/  1 

0 

0 

0 

0 

0 

0 

°\ 

8 

1 

0 

0 

0 

0 

0 

0 

13 

3 

1 

0 

0 

0 

0 

0 

3 

12 

8 

1 

0 

0 

0 

0 

0 

4 

0 

14 

1 

0 

0 

0 

8 

2 

1 

i 

9 

1 

0 

0 

10 

2 

14 

2 

13 

14 

1 

0 

\  5 

14 

2 

4 

14 

12 

9 

1/ 

/I 

1 

1 

1 

1 

1 

1 

1  \ 

0 

13 

3 

16 

16 

3 

13 

0 

0 

0 

4 

14 

14 

4 

0 

0 

0 

0 

0 

12 

12 

0 

0 

0 

0 

0 

0 

0 

9 

8 

9 

8 

0 

0 

0 

0 

0 

15 

7 

8 

0 

0 

0 

0 

0 

0 

2 

10 

\0 

0 

0 

0 

0 

0 

0 

6  / 

123 
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0 

0 

0 

0 

0 

0 

°\ 

15 

4 

0 

0 

0 

0 

0 

0 

16 

5 

13 

0 

0 

0 

0 

0 

13 

1 

12 

10 

0 

0 

0 

0 

0 

8 

0 

11 

2 

0 

0 

0 

13 

16 

8 

5 

4 

8 

0 

0 

5 

1 

7 

1 

15 

7 

9 

0 

\15 

8 

6 

12 

8 

2 

10 

3  ) 

/I 

1 

1 

1 

1 

1 

1 

1  \ 

0 

1 

12 

13 

13 

12 

1 

0 

0 

0 

1 

12 

12 

1 

0 

0 

0 

0 
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1 

1 

0 

0 

0 
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1 
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0 
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0 
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\0 

0 

0 

0 

0 
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0 
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Observation  about  the  B 42  basis 

The  conversion  matrix  given  above  for  the  B. 12  basis  factors  in  a  natural  way: 

1  1  1  1  1  1  1  1  \ 

9  5  12  8  8  12  5  9 

11  6  6  11  11  6  6  11 

589  12  12  985 

13  4  13  4  13  4  13  4 

15  3  3  15  2  14  14  2 

7  7  10  10  7  7  10  10 

14  15  15  14  3  2  2  3  / 
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12 
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0 

0 

0 
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5 

8 

9 

12 

0 

0 
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13 

0 

0 
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0 

0 

0 
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6 

11 

11 

6 

13 

0 

0 

0 

0 

0 

0 

4 

\0 

0 

0 

0 

8 

12 

5 

9  ) 

\  0 

0 

13 

0 

0 

4 

0 

0/ 

The  4x4  blocks  in  the  left  matrix  of  the  product  are  the  conversion  matrices  for  the  B\2 
bases.  This  is  because  the  B\i  basis  can  be  written  as  the  union  of  a  B 12  basis  and  a 
multiple  of  a  B\2  basis: 

B42  =  {l,  y/2  +  >/2,  V2,  y/2  -  V2,  i,  y/2  +  y/2i ,  V2i ,  y/2  -  y/2i^ 


{l,  y/2  +  V2,  y/2,  y/2 -y/2}  U  {i,  v/2  +  n/2z  ,  n/2i,  n/2  -  v/2i}  . 

Because  of  this  basis  factorization,  two  4x4  conversions  could  be  used  instead 
of  one  8x8  conversion.  The  4x4  conversions  can  occur  in  parallel,  but  need  to 
be  followed  by  some  simple  modulo  17  additions  representing  multiplication  by  the 
sparse  right  matrix  of  the  product.  The  factorization  of  bases  and  the  corresponding 
factorization  of  the  conversion  matrices  will  be  the  subject  of  further  investigation. 
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i  I 

\  MISSION  2 

\  .  of 

§  Rome  Air  Development  Center  t 

\  l 

N  RADC  plans  and  executes  research,  development,  test  and  selected  3 

%  acquisition  programs  in  support  of  Command,  Control,  Communications  % 

&  and  Intelligence  ( C3!)  activities.  Technical  and  engineering  support  within  S 

»  areas  of  competence  is  provided  to  ESD  Program  Offices  (POs)  and  other  £ 

ft  ESD  elements  to  perform  effective  acquisition  of  cV  systems.  The  areas  ^ 

v  of  technical  competence  include  communications,  command  and  control,  C 

^  battle  management,  information  processing,  surveillance  sensors,  v 

•>  intelligence  data  collection  and  handling,  solid  state  sciences,  Q 

»  electromagnetics,  and  propagation,  and  electronic,  maintainability,  and 
^  compatibility.  ^ 


